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PREFACE 


This  Technical  Report  forms  a continuation  of  the  scries  started  with  Techni- 
cal Report  No.  3,  Electronic  Structure  of  Atoms  and  Molecules,  and  Technical  Re- 
port No.  -4,  Electronic  Structure  of  Solids  I;  The  Energy  Band  Method.  As  was  men- 
tioned in  the  prefaces  of  those  reports,  it  forms  part  of  a series  of  advanced  lectures 
on  the  theory  of  molecules  and  solids,  in  which  .1  am  endeavoring  to  cover  the  main 
features  of  the  electronic  structure  of  matter.  !t  has  been  decided  that  it  is  m.ore 
practical  to  issue  this  material  a chapter  at  a time;  hence  there  are  still  further  re- 
ports to  come  in  the  series,  dealing  with  configuration  interaction  in  solids,  magnetic 
problems,  and  various  related  topics  which  depen.i  on  the  many-electron  aspect  of  the 
structure  of  solids.  This  report  and  the  preceding  one,  in  contrast,  deal  with  the  one- 
electron  problem. 

It  will  be  recognized  that  a great  deal  of  the  material  in  the  present  l eport  is 
new,  fiarticularly  the  methods  for  handling  perturbation  problemis,  in  term.s  of  Wan- 
nier  functions,  using  the  technique  of  difference  equations.  This  material  is  also  de- 
scribed very  briefly  in  the  Quarterly  Progress  Reports  of  October  15,  1953,  and  Jan- 
uary 15,  1954.  As  soon  as  possible,  it  will  be  written  up  for  publication,  so  that  it 
will  be  convenient  to  make  reference  to  it. 


John  C.  Slater 


Cambridge,  Mass. 
December,  1953 


CHAPTER  2 


THE  PERTURBED  PERIODIC  LATTICE 

In  Chapter  i*  we  have  tr  eated  the  problem  of  an  electron  moving  in  a pei  iodic 
potential  such  as  we  find  in  a crystal,  v/tsen  we  solve  the  problem  by  the  method  of 
self-consistent  field.  Later  we  shall  go  on  to  the  problem  of  setting  up  determinantal 
wave  functions  made  from  such  one-electron  orbitals,  and  of  the  configuration  inter- 
action between  such  determinantal  wave  functions.  Before  we  do  so,  however,  we 
shall  take  uo  the  one-electron  problem  in  a lattice  which  is  no  longer  perfectly  periodic, 
but  has  suffered  some  sort  of  perturbation.  This  perturbation  may  be  in  the  form  of 
an  impurity  atom  which  is  substituted  for  one  of  the  atoms  of  the  lattice;  it  may  be 
displacements  of  atomic  positions,  either  on  account  of  dislocations  or  other  lattice 
defects,  or  regular  displacements  such  as  would  be  produced  by  a sound  wave  travers- 
ing the  crystal.  Other  perturbations  are  present  at  the  boundaries  of  real  crystals, 
where  we  may  not  use  the  periodic  boundary  conditions  so  useful  in  the  interior  of  a 
crystal.  All  of  these  qiiestions  are  of  obvious  practical  importance,  in  such  problems 
as  the  impurity  atoms  in  semiconductors,  the  behavior  of  alloys,  the  effect  of  imper- 
fections or  thermal  agitation  on  the  scattering  of  electrons,  sui-face  states  on  the  sur- 
face of  a crystal,  the  effect  of  missing  or  interstitial  atoms,  and  many  other  problems. 
The  method  used  for  attacking  all  these  types  of  problems  can  be  worked  out  in  a gen- 
eral way,  and  we  shall  do  so  in  this  chapter,  later  extending  it  to  specific  applications. 

1.  The  Wannier  Function  Method  for  Perturbed  Lattices 

The  most  generally  useful  method  for  handling  the  perturbed  lattice  depends  on 
the  use  of  Wannier  functions.  We  assume  that  ve  start  with  an  unperturbed  periodic 
problena,  with  Hamiltonian  H^,  consisting  of  the  kinetic  energy  operator,  and  a per- 
iodic potential,  and  that  we  have  solved  this  problem  exactly,  so  that  we  know  its  en- 
ergy levels  and  wave  functions  as  functions  of  the  propagation  constant  k.  Our  prob- 
lem has  a Hamiltonian  H,  equal  to  + H,,  where  H,  is  a perturbative  poteiitiai.  In 
the  case  of  an  impurity  atom,  for  instance,  Hj  would  represent  the  difference  between 
the;  potential  around  the  imi)urity  atom  and  the  pei  iodic  potential  which  we  should  see 
if  the  impurity  atom  were  replaced  by  one  of  the  regular  atonis  of  the  lattice,  so  that 
it  would  be  vanishingly  small  except  in  the  immediate  neighborhood  of  the  impurity 
atom.  We  shall  now  expand  the  solutions  of  the  perturbed  proCiem  in  terms  of  the 
Wannier  functions  of  the  unperturbed  problem.  The  reason  why  the  Wannier  functior;. 
are  so  convenient  for  this  purpose  is  that  they  are  rather  localized  so  that  only  those 
located  near  the  impurity  atom  are  appreciably  affected  by  the  perturbation. 

We  .saw  in  Section  3,  Chapter  1.  that  corresponding  to  each  energy  band  we  can 

♦References  to  Chapter  1 refer  to  Technical  Report  No.  4,  Solid  -State  and  Molecular 
Theory  Group,  M.  I.  T . , entitled  Electronic  Theory  of  Solids:  The  Energy  Band 
Method  (Chapter  l),  .July  15,  1955^ 
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set  up  Wannier  functions,  one  centered  o’l  each  unit  cell,  such  that  the  Wannier  func- 
tions associated  with  different  unit  cells  are  orthogonal.  It  is  also  easy  to  prove, 
though  we  did  not  do  it,  that  the  Wannier  functions  associated  with  different  energy 
bands  are  orthogonal.  If  a (r  - R^)  represents  the  Wannier  function  corresponding  to 

th  J -r. 

the  n band,  localized  in  the  unit  cell  whose  vector  position  is  iij,  then  as  in  Eq.  (l.  32) 
of  Chapter  1 we  can  show  that 


H a_(r  - R ) = ^ 

J I M I j II  L 


u.  1) 


That  is,  (f  (it.  - R.)  is  the  matrix  component  o'  H between  the  Wannier  functions  at 
n I J o 

Rj  and  R^.  We  can  express  this  otherwise  by  multiplying  Eq.  (2.  l)  by  ap*(r  - P-^).  and 
integrating  over  the  periodic  volume  of  N unit  cells,  over  which  the  a's  are  normalized 
and  orthogonal.  Then  we  have  at  once 


Hr  - RJ  H„a„{r  - RJ  dv  = ^^(R.  - Rd. 


(2.  2) 


We  see  from  Eq.  (2.1)  that  has  no  nnatrix  components  between  Wannier  functions 
associated  with  different  energy  bands. 

We  can  now  set  up  a Schrodinger  equation  for  the  unperturbed  problem.  We  ex- 
press the  wave  function  in  the  form  X(n,  j)  U (R.)  a (r  - R ),  where  the  U's  are  coeffi- 

ti  3 t'l  j 

cients.  We  know  that  we  can  make  such  an  expansion,  for  we  mentioned  in  Chapter  1 
that  the  a's  form  a complete  orthogonal  set  of  functions,  and  any  function  can  be  ex- 
panded in  terms  of  them.  We  se;t  up  the  unperturbed  Schrodinger  equation  H^u  = Eu, 
wheie  u is  the  wave  function.  We  substitute  the  sum  of  Wannier  functions  in  this  equa- 
tion, multiply  by  the  conjugate  of  one  of  the  Wannier  functions,  and  integrate  over  the 
volume.  Using  (2.  1)  or  (2.  2),  we  obtain 

y(R.)  ^ (R.  - ^.)  U (R.)  = EU  (S.).  (2.3) 

^ ] n i j'  n y n i'  ' ' 


This  set  of  linear  equations  would,  of  course,  lead  to  a secular  equation  of  the  ordinary 
sort  for  the  energy  E.  We  can  solve  it  more  conveniently,  however,  by  assuming  that 
U (R.)  = exp(ik  • R.).  This  assumption  is  equivalent  to  building  up  a Bloch  sum  of  the 
Wannier  functions  in  the  n band,  just  as  in  Eq  (l.  28),  Chapter  1.  When  we  make 
this  substitution  in  Eq.  (2.  3),  we  find  that  all  our  equations  are  simultaneously  satis- 
fied, provided 

E = exp(- ik  ■ R^).  (2.4) 


- / - 
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Eq.  (2.  4)  is  identical  with  Eq.  (l.  33)  of  Chapter  1,  though  our  method  of  derivation  is 
a little  different.  Since  the  energy  E of  Eq.  (2.  4)  depends  on  k,  and  on  the  energy 
band  we  are  dealing  with,  we  should  have  a notation  indicating  these  two  quantities;  we 
• nay  denote  the  energy  by  the  symbol  E^(k). 

We  now  ask  what  effect  on  these  results  is  produced  by  the  perturbative  Hamil- 
tonian H,.  Let  the  matrix  componen;  of  H.  between  two  Wannier  functions,  in  the  n 

and  m bands,  and  in  the  unit  cells  at  R.  and  R.,  be 

1 J 


fa^r- 


a *(r  - R.)  H,a  (r  - R.)  av  = V (R.,  R.). 
n ' rim  j nm'  i j' 


Then  in  place  of  (Z-  3),  Schrodinger 's  equation  becomes 

y(m,  (R.  - R.)  6 + V (R.,R,)1  U (R.)  ^ EU  (R  ). 


(2.  5)' 


This  set  of  equations  provides  an  exact  solution  of  our  problem,  if  we  can  solve  them. 

In  some  important  cases,  as  we  shall  show  later,  an  exact  solution  is  possible.  More 
often  we  can  only  approximate  a solution.  Our  first  aim  will  be  to  look  into  the  quali- 
tative nature  of  the  problem,  so  as  to  understand  the  type  of  solution  to  be  expected. 
Specially  simple  cases  result  when  the  components  for  n f m and  for  R^  f R^  can 

be  neglected;  that  is,  when  the  perturbations  do  not  result  in  mixing  of  the  Wannier 
functions  of  different  bands,  or  different  unit  cells.  Unfortunately,  we  shall  see  later 
that  in  most  important  cases  this  simpdification  does  not  hold,  but  ther  e are  some  cases 
where  it  does.  In  case  it  does,  Eq.  (2.  5)  simplifies  in  that  W'e  can  omit  the  summation 
over  ni,  replacing  m by  n on  the  left  side.  We  are  dealing,  then,  only  with  the  quan- 
tities U^(Rj)  relating  to  a single  band. 

The  most  important  feature  of  our  problem  is  the  fact  that  U (R.),  though  really 
a set  of  coefficients,  nevertheless  has  much  of  the  character  of  a continuous  function 
U^(r),  whose  values  at  the  discrete  points  r = R.  give  tlie  U^(R.)'s.  For  the  unperturbed 
problem,  where  V is  zero,  we  have  seen  that  a solution  of  Schrodinger 's  equation  is 
given  by  assuming  that  U (R.)  = exp(ik  • ),  which  obviously  means  that  our  continuous 

function  is  the  plane  wave  exp(ik  • r),  whose  values  for  r = R.  give  the  U's.  But  such 
a plane  wave  is  the  solution  of  a Schrodinger  problem  for  a free  particle.  The  remark- 
able fact  that  we  find  is  that  in  many  cases,  in  the  presence  of  the  perturbation,  the 
continuous  function  U^(r ) has  the  char  acter  of  the  solution  of  the  Schrodinger  equation 
for  a particle  in  the  presence  only  of  the  perturbative  potential  H,.  In  other  words,  by 
using  the  Wannier  functions,  we  can  often  take  account  almost  automatically  of  the  f>er- 
iodic  potential  and  the  band  nature  of  the  .solution?;,  and  have  a problem  almost  as  sim- 
ple as  a free  electron  in  a perturbing  field.  Let  us  examine  the  extent  to  which  this  is 
true. 
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The  possibility  oi  making  this  identification  arises  when  the  function  U(r)  (we 

shall  drop  the  index  n,  for  simplicity,  since  here  we  are  dealing  only  with  a single 

band)  varies  so  slowly  from  one  unit  ceil  to  the  next  that  we  can  replace  the  differences 

in  Eq.  (2.  5)  by  derivatives.  Then  we  can  convert  Eq.  (2.  5)  from  a ilifference  equation 

into  a differential  equation,  which  can  often  take  a character  similar  to  Schrodinggr's 

equation  for  a free  particle  under  the  influence  of  the  perturbative  potential  H,.  An 

^ 1 ) ^ 

elegant  mathematical  method  for  proving  this  was  given  by  Wannier,  ' ' providing  the 
main  foundation  for  the  treatment  of  perturbed  lattices  which  goes  by  his  name.  The 
proof  starts  by  considering  the  energy  E (k  ),  or  E(k),  of  Eq.  (2.  4).  The  propagation 
vector  k,  in  this  expression,  can  be  related  to  a momentum -like  quantity  p,  by  the 
equation  k = Ziip/h.  That  is,  p is  the  momentum  of  a particle  whose  de  Broglie  wave 
length  would  be  such  that  the  propagation  constant  would  be  k.  Sometimes  p is  called 
the  pseudomomentum,  since  it  is  not  the  real  momentum  of  the  electron.  We  now  take 
E(k  ),  express  k in  terms  of  p.  and  then  convert  this  into  a differential  operator,  by 
applying  to  p the  usual  rule  used  in  setting  up  a Schrodinger  equation.  That  is,  we 
replace  p by  - ijftV,  or  replace  k by  - iV.  V/e  then  allow  this  to  operate  on  the  func- 
tion U(r),  and  shall  find  that  the  result  of  this  operation  helps  us  in  solving  Eq.  (2.  5). 

We  can  set  up  this  operator  in  -ither  of  two  ways.  In  the  first  place,  we  can 
use  Eq.  (2.  4),  and  replace  k where  it  appears  in  the  exponent  by  - iV  We  then  can 

use  a simple  property  of  such  an  operator.  Let  ns  consider  the  quantity  exp(-  ik  R__  ). 

12  2 X sx 

We  can  expand  this  by  power  sei  ies,  to  give  1 - ik  R - k R , ....  If  we  re- 

place  k,  by  - i8/9x,  according  to  our  prescription,  and  allow  the  result  to  operate  on 
the  function.  f(x),  the  l esidt  is  f(x)  - R 9f/  8x  + ~ K b if  bx  ...  = f(x  - R ) by 

^ X S X S X 

Taylor's  theorem.  We  thus  find  that  exp(-  ik  • R,)  U{?)  becomes  equivalent  to 
U(r  - Rj^).  when  the  exponential  is  converted  into  an  operator.  Using  (2.4),  then,  we 
see  that  the  quantity  E(iT)  U(r)  becomes  converted  into  ^n^^s^  U(iT  - R^).  when 

k is  converted  into  an  operator.  But  we  may  now  rewmite  the  first  term  of  Eq.  (2.  5) 
in  the  form  ^ U(?.  - R ),  by  letting  R be  equivalent  to  R - ft..  Thus  we 

see  that  we  can  write  the  first  term  of  (2.  5)  in  terms  of  our  operator. 

Our  simplification  of  the  method  holds  only  in  case  ^j)  is  a completely 

diagonal  matrix:  that,  is,  not  only  do  the  components  vanish  if  n f m,  but  also  they 
vanish  if  R.  ^ If  we  assume  that  this  is  the  case,  then  we  can  define  a continuous 
perturbation  function  V(r^),  whose  value  equals  V(Sj^,  Rj^)  when  r = ^..  In  that  case, 

(2.  5)  can  be  rewritten  in  the  form 

E(r)  U(r)  + V(r')  U(r“)  = EU(r’),  (2.6) 


^G.  H.  Wannier,  Phys.  Rev.  52,  19l  (1937);  see  also  J.  C.  Slater,  Phys.  Rev.  T^, 
1592  (1949). 
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whore  E(k)  is  to  be  interpreted  as  the  operator  which  we  have  described.  But  now  we 
can  approach  the  operator  E(k ) in  a different  way,  by  expanding  E in  power  series, 
instead  of  in  Fourier  series  as  in  Eq.  (2.  4).  Let  us  expand  about  a point  wliere  E, 
regarded  as  a function  of  k,  has  a miniinuna  or  maximum  value.  Around  such  a point 
we  have 


E(iT) 


'a^F 

(k  - k )^  + 

X xo 

2 

(k  - k )(k  - k ) t . 

X xo  y yo' 

ak  ^ 

9k  ak 

X 

o 

X y 

o 

_ 

(2  7) 


+ terms  in  third  and  higher  powers. 


In  making  such  an  expansion,  we  are  assuming  that  we  do  not  have  the  situation  de- 
scribed in  Eq.  (43),  Chapter  1,  in  which  we  had  degenerate  bands,  and  could  not  carrj' 
out  a power  series  expansion  in  k.  We  can  now  replace  k , by  - i8/3x,  etc.  , to  convert 
(2.  7)  into  an  operator.  We  find  that  U(r)  does  not  itself  vary  slowly  with  r,  but  that 
rather  we  can  set  up  an  expression 

U(r)  = exp(ik^  • r)  W(r  ),  (2.8) 


where  W(r  ) will  prove  to  vary  slowly.  We  find  immediately  that  for  instance  the  opera- 
tor formed  from  k^  - k^^,  operating  on  exp(ik^  ■ r)  W(r  ),  equals  exp(ik^  • r)(-  i9/9x) 
W(r  ),  and  similarly  with  any  powers  or  products  of  the  operators  like  k^  - Hence, 

substituting  (2.  8)  into  Eq.  (2.  6),  we  can  cancel  the  exponential,  and  are  left  with 


! 

'a^E 

a^W(r)  ^ 29^E 

■d^VJ(r)  ^ 

- 

9k  ^ 

X 

8x^  9k  ak 

o x y 

o 

+ terms  in  third  and  higher  derivatives  of  W(r  ) 
+ V{r)  W(r  ) = EW(r  ). 


(2  9) 


In  some  cases  W(r ) will  vary  slowly  enough  with  position  so  that  the  terms  in 
the  thi-  d and  higher  derivatives  can  be  neglected  in  comparison  with  the  second  deriva- 
tive terms  which  we  have  retained.  As  far  as  the  second  derivative  terms  are  con- 
cerned, we  can  perform  a rotation  of  axes  in  ordinary  space,  in  such  a way  as  to  re- 
move the  cross  derivatives;  we  may  as  well  assume  that  this  has  already  been  done, 

so  that  9 E/9k  9k  , etc.  , are  zero.  Then  we  can  rewrite  (2.  9)  in  the  form 
/ X y-  • ' ' 
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1/ 

I j 


/ 1 a^w 


.2'  I 


m dx 

X 


a^w 

9y" 


+ _L  £lw^  ,fE  + 

m dz‘^  ^ *- 


V(r)l 


W = EW, 


(2.  10) 


where 


= ^^/(a^E/ak^"),  etc. 


(2.  ! 


In  other  words,  in  case  all  cur  assumptions  are  justified,  we  have  converted  cur  dif- 
ference equations  (2.  5}  into  a form  which  is  like  Schrodinger 's  equation  for  W,  only 
with  different  effective  masses  for  motion  in  the  tlu  ee  coordinate  directions,  if  we 
are  dealing  with  a minimum  of  E(k),  so  liiat  the  second  derivatives  .are  positive,  the 
effective  mass  is  positive,  while  if  we  are  at  a maximum  of  E(k  ) the  effective  mass  is 
negative;  \n  such  a case  it  is  more  convenient  to  change  the  sign  of  Eq.  (2.  10),  treat 
the  effective  mass  as  being  defined  by  the  negative  of  (2.  11),  and  make  a compensating 
change  in  the  sign  of  the  potential  energy,  so  that  it  is  as  if  we  were  dealing  with  a 
positively  charged  particle  instead  of  a negatively  charged  electron,  which  then  would 
be  subjected  to  a force,  and  potential  energy,  which  would  be  the  negative  of  those 
acting  on  an  electron. 

The  simplest  case  which  we  can  have  is  that  in  which  m , m , m are  all 
equal,  in  which  case  (2.  10)  becomes  just  an  ordin  "y  Schrodinger  equation  for  a par- 
ticle of  mass  m in  a potential  V.  A go^.d  deal  of  d icussion  of  impurity  levels  in 
semiconductors  has  been  based  on  this  assumption,  plus  the  further  assumption  that 
V(i’)  is  a Coulomb  potential,  in  a material  of  dielectric  constant  equal  to  that  of  the 
semiconductor.  Then  the  problem  becomes  mathematically  like  that  of  hydrogen,  and 
we  can  get  solutions  for  hound  states  just  like  those  in  hydrogen.  We  easily  find  that 
the  linear  extension  pf  the  wave  functions  in  space  is  greater  than  in  hydrogen,  by  a 
factor  equal  to  the  dielectric  constant  of  the  material,  divided  by  the  ratio  of  the  ef- 
fective mass  of  tne  particle  Lo  the  true  mass  of  the  electron,  and  the  binding  energy 
is  smaller  than  in  hydrogen,  by  a factor  equal  to  the  square  of  the  dielectric  constant, 
divided  by  the  ratio  of  masses.  In  the  case  of  germanium,  for  instance,  with  a di- 
electric  constant  in  the  neighborhood  of  l6,  and  effective  masses  of  the  order  of  half 
the  electronic  masses,  this  leads  to  wave  functions  extended  over  a good  many  Ang- 
stroms, but  with  binding  energies  which  are  a very  small  fraction  of  a volt. 

These  widely  extended  wave  functions,  which  we  can  often  get  by  solution  of 
(2.  10),  allow  us  to  examine  the  justification  for  the  assumption  that  third  and  higher 
derivatives  can  be  neglected  in  the  expansion  (2.  9).  The  energy,  as  we  know  from 
(2.  4),  can  be  expanded  in  a Fourier  series,  and  ordinarily  the  first  few  terms  of  tiiis 
series  are  the  important  ones.  Thus  we  see  that  the  s.iccessive  derivatives  of  E with 
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respect  to  k will  be  in  the  ratio  of  powers  of  R , the  lattice  spacing.  If  the  function 

- ar  ^ 

W IS  falling  down  like  an  exponemia.T.  e , as  we  should  have  for  a hydrogen-like 
case,  successive  derivatives  will  be  in  the  ratio  of  powers  of  a,  inversely  propor- 
tional to  the  linear  dimensions  of  the  wave  function.  In  other  word.s,  we  rnay  expect 
the  terms  in  (2.  9)  in  successively  higher  derivatives  of  W to  fall  off  in  the  ratio  of 
the  lattice  spacing  to  the  linear  dimensions  of  the  wave  function.  If  the  wave  fnnntion 
extends  over  many  lattice  spacings,  as  in  the  case  of  gei niajiium,  we  may  expect  suc- 
cesjsive  terms  to  get  rapidly  smaller,  so  that  the  reduction  of  the  equation  to  the 
Schrodinger  form  (2.  10)  is  legitimate.  On  the  other  hand,  if  it  does  not  extend  out 
very  far,  as  is  the  more  usual  case  in  a material  of  smaller  dielectric  constant,  it  is 
not  legitimate  to  neglect  the  higher  derivatives.  In  such  a case  we  can  try  to  retain 
additional  terms  in  (2.  9),  but  it  is  much  better  and  more  straightforward  to  return  to 
the  difference  equation  (ji.  5),  and  try  to  solve  this  directly.  The  writer  suspects  that 
the  number  of  actual  cases  where  the  approach  by  means  of  the  differe  .ial  equation 
is  justified  is  very  limited,  and  that  in  most  cases  we  are  on  much  firmier  ground  to 
use  the  difference  equation.  However,  the  differential  equation  is  often  useful  'T  point- 
ing out  the  general  nature  of  the  solution  to  be  expected. 


2.  The  One -Dimensional  Case 

Actual  crystals  of  course  are  three-dimensional;  but  in  a corresponding  one 
dimensional  problem  it  is  easier  to  solve  our  difference  equations  exactly,  and  the  in- 
sight gained  in  this  way  can  help  us  to  under.stand  the  more  difficult  three -dimensional 
case.  Accordingly  we  shall  consider  in  this  section  the  special  case  of  Eq.  (2.  5) 
which  arises  when  we  have  a set  of  equally  spaced  atoms  along  a line  We  shall  limit 
ourselves  for  the  present  to  the  case  v.'here  only  one  band  need  be  considered,  though 
later  we  shall  take  up  the  interaction  between  different  bands.  We  can  then  simplify 
the  notation  of  (2.  5),  in  an  obvious  way,  and  write  it 


[<^(P  ■ <l)  V(p,  q)j  U(q)  = EU(p),  (2.  12) 

where  p is  the  index  indicating  the  atom  along  the  line.  To  show  the  general  nature 
of  the  problem,  we  shall  start  with  the  simplest  example.  This  is  the  one  in  which 
there  is  only  a nearest -neighbor  interaction  between  atoms,  and  where  the  perturbation 
is  found  only  on  a single  atom.  By  nearest -neighbor  interaction,  we  mean  that  the 
matrix  component  of  H^  between  two  Wannier  functions,  as  given  in  (2.  2),  vanishes  if 
the  Wannier  functions  are  located  on  two  atoms  more  distant  than  nearest  neighbors. 

That  is,  <o(s)  is  zero  if  s is  greater  than  1.  It  is  at  once  obvious  from  (2.  2)  that 
<^(-s)  = (^(s),  so  that  (^(-1)  = (g’(l),  and  we  have  only  two  non-vanishing  coefficients, 
<^(0)  and  «f(l).  By  stating  that  the  perturbation  is  located  on  only  a single  atom,  we 
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mean  that  '■'(p,  q)  = 0 unless  p,  q are  each  equal  to  the  index  of  the  impurity  atom, 
which  we  may  as  v/ell  take  to  be  zero. 

With  these  simple  assumptions,  Eq.  (Z.  IZ)  reduces  to 


[i’{0)  - e]  U(p)  + <^{!)  |u(p  + i)  + U(p  - l)j  = 0 if  p 0 
[<^(0)  + V(0)  - eJ  U(0)  + <f(l)  [u(l)  + U(-  i)j  = 0. 


(2.  13) 


These  equations  can  easily  be  solved  exactly,  for  the  type  of  boundary  conditions  we 
arc  interested  in.  Wc  try  a solution  expfikRn)  for  U(p),  where  k is  a propagation 
constant  as  before,  R is  the  interatomic  distance,  so  that  Rp  is  the  distance  to  an  atom 
p units  away.  Then  the  general  equation,  written  in  the  first  line  of  (Z.  13),  will  auto- 
matically be  satisfied,  .as  in  Section  1,  provided 


E = <^(0)  + 2 1^(1)  cos  kR. 


(Z.  14) 


The  second  equation  of  (2.  13),  however,  is  a special  case,  and  will  not  be  satisfied  by 
our  exponential  solution.  We  can  combine  exponentials  with  equal  positive  and  negative 
k's,  however,  or  what  is  the  same  thing,  use  a cosine  function  with  a phase  to  be  de- 
termined. That  is,  we  can  let  U(p)  = cos  (kRp  - a),  where  a is  a phase.  This  as- 
sumption will  satisfy  the  first  equation  of  (2.  13)  as  well  as  the  exponential. 


tieiore  consioering  tlie  second  equatic' 


. ..<•/  7 I 7 I 
» *-/» 
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solution,  determined  by  symmetry.  The  potential  is  symmetric  on  reflection  in  the 
impurity  atom.  Therefore  the  solutions  must  be  symmetric  or  antisymmetric,  or  must 
be  even  or  odd  functions  of  p.  If  they  are  odd  functions  of  p,  U(0)  = 0,  and  U(-  1)  = 

- D(l),  so  that  the  second  equation  of  (2.  13)  is  automatically  satisfied.  In  other  words, 
an  antisymmetric  solution  can  be  written  in  the  form  sin  kRp,  and  as  far  a.s  such  solu- 
tions are  concerned,  the  pei  turbation  has  no  effect  at  all.  We  need  then  consider  only 
the  symmetric  solutions.  We  might  at  first  think  that  these  should  be  written  simply 
cos  kRp,  but  this  is  wrong,  for  we  already  know  that  this  function  will  not  satisfy  the 
second  equation  of  (2.  13).  Instead,  we  must  let  U(p)  = cos  (kRp  - a)  when  0 p,  but 
define  U(-p)  = U(p).  This  will  give  a function  with  discontinuous  slope  at  p = 0;  but 
this  does  not  trouble  us,  for  we  are  interested  in  U(n)  only  for  integral  valu  e.s  of  p, 
and  the  continuity  of  the  function  does  not  concern  us. 

Let  us  then  assume  U(p)  is  as  given  above,  and  substitute  in  the  second  equa- 
tion of  (2.  l3).  We  have 

[^<^(0)  + V(0)  - <f(0)  - 2 ,^(1)  cos  kr]  cos  Q + 2(^‘(i)  cos  (kR  - a)  = 0.  (2.  15) 
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Wc  expand  cos  (kR  - a),  cancel  terms,  and  find 


V(0)  cos  a + 2«f  (l)  sin  kP.  sin  a = 0, 
tan  a = - V(0)/{2«?(i)  sin  kR). 


(2.  16) 


In  (2.  !6)  we  find  how  to  choose  a so  that  our  assumed  values  of  U(p)  will  form  a com- 
plete solution  of  Eq.  {2.  13).  We  thus  have  a general  solution  of  the  problem,  for 
arbitrary  energy  values.  The  form  of  (2.  14),  however,  leads  to  a real  k only  if  the 
energy  E is  between  (#’(0)  t 2^(1),  for  only  in  this  case  is  |cos  kR|  less  than  1. 

That  is,  for  any  energy  within  the  single  energy  band  described  by  our  single  Wannier 
function,  we  can  set  up  a sinusoidal  solution  of  this  type.  If  we  have  to  satisfy  a 
boundary  condition,  such  as  that  the  function  vanish  at  a fixed  boundary,  this  will  lead 
to  quantum  conditions  determining  discrete  energy  levels;  we  shall  take  up  such 
boundary  conditions  later. 

The  only  effect  of  the  pertui'bation  on  these  energy  levels  within  the  band,  as 
we  see,  is  to  determine  the  phase  of  the  wave  function.  It  is  interesting  to  note  that 
if  V(0)  is  very  large  numerically,  so  that  tan  a in  Eq.  (2..  i6)  has  large  magnitude,  then 
a approaches  tt/2,  so  that  U(p)  approaches  sin  kRp,  which  goes  to  zero  at  p = 0.  In 
other  words,  the  effect  of  a very  large  perturbation  is  to  make  the  wave  functions  of 
the  states  within  the  energy  band  avoid  the  perturbing  atom.  On  the  other  hand,  we 
shall  now  show  that  the  perturbation  introduces  another  bound  state,  which  tends  to  be 
concentrated  on  the  perturbing  atom,  ^ailing  off  exporentially  a.s  we  go  away  from  it. 

To  describe  such  a case,  clearly  our  exponential  type  of  solu+'cn  is  not  appropriate. 
This  bound  state,  we  shall  find,  lies  outside  the  limits  of  the  band,  so  that  cos  kR  in 
Eq.  (2.  14)  would  be  greater  numerically  than  unity,  and  k imaginary.  It  is  better, 
then,  to  start  from  the  beginning  by  assuming  a pure  exponential  function  for  U(p). 

Let  us  then  assume  that  U(p)  = exp  (-yRp)  for  positive  p,  and  again,  for  the  sym- 
metric state  which  alo.ne  comes  into  consideration,  that  U(-p)  = U(p).  We  could,  of 
course,  also  have  a solution  increasing  exponentially  with  p,  but  this  obviously  would 
be  inadmissible. 

II  we  substitute  our  solution  U(p)  = exp(-  yRp)  in  the  first  equation  of  (2.  i 3), 
we  find  at  once  that  we  have  a solution  if  the  energy  is  given  by 


E = <T(0)  + 2<f  (1)  cosh  yR,  (2.17) 

the  anailogue  of  (2.  14).  For  y = 0,  the  case  where  the  wave  function  falls  off  infinitely 
slowly  with  r,  the  energy  is  <^(0)  + 2 ^(l),  lying  at  the  top  of  the  band  if  cf  (l)  is  posi- 
tive, at  the  bottom  if  it  is  negative.  We  shall  find  that  the  bound  state  we  are  inter- 
ested in  becomes  detached  from  the  band  if  the  perturbative  energy  V(0)  has  the  correct 


-9- 


THE  PERTURBED  PERIODIC  LATTICE 


sign  to  push  the  energy  of  this  state  away  from  the  band:  that  is,  if  it  is  positive  if 
the  state  is  at  the  top  of  the  band,  or  negative  if  at  the  bottom.  The  case  we  are  in- 
terested in  is  then  that  where  V(0)  and  <^(l)  ha  m the  same  sign.  We  may  now  substi- 
tute our  expression  for  U(p),  and  (2.  17)  for  the  energy,  in  the  second  equation  of  (2.  13), 
and  we  find,  remembering  that  U(-  i)  = U(i), 


sinh  yR  = VjZ^{\). 


(2.  18) 


This  equation  determines  the  value  of  y,  showing  that  it  is  zero  for  = 0,  but  be- 
comes greater  without  limit  as  increases.  We  may  substitute  (2.  18)  in  (2.  17),  to 
obtain  an  explicit  formula  for  the  energy  of  the  bound  state: 


E 


«?(U)  + 2<^  (l  ).,^' 


.e  ( \ i ) ^ 


(2.  19) 


We  see  that,  for  small  values  of  V^/2  6?(l),  the  energy  departs  from  the  value  at  the 
band  edge  by  a small  quantity  of  the  second  order,  while  the  value  of  y is  small  of  the 
first  order. 

We  have  just  seen  that  a single  energy  level  pushes  away  from  the  energy  b.and 
under  the  action  of  the  perturbation  and  that  its  wave  function  falls  off  exponentially 
as  we  go  away  from  the  perturbing  atom.  As  becomes  very  large  compared  to  the 
band  width,  the  exponential  fall -of"  becomes  very  rapid,  though  it  is  small  for  small 
V^-  At  this  point  we  can  give  an  elementary  check  of  our  procedure  of  the  preceding 
section,  in  which  we  replaced  the  difference  equation  by  a differential  equation  of  the 
dchrodinger  type.  We  are  directed  to  start  with  the  energy  expression  (2.  14),  and  re- 
place k in  it  by  the  operator  - i d/dx.  We  tlien  use  this  operator  to  set  up  a Schro- 
dinger  equation  of  form  (2.  9).  In  our  case,  this  leads  to 


(^(0)  W'(x)  + 2<^(1)  jw'  + 


i R^ 
2; 


d^W 

dx^ 


1 d^W 

4-’  dx4 


J + V(x)  W(x)  = EW(x),  (2.  20) 


where  V(x)  is  the  perturbative  potential,  which  is  zero  except  when  x = 0,  so  that  ex- 
cept at  X = 0 we  can  omit  the  term  in  V.  The  simplified  Schrodinger  equation  corres- 
poTiding  to  (2.  11)  is  obtained  by  omitting  everything  beyond  the  second  derivative  in 
(2.  20).  Now  we  can,  in  this  simple  case,  solve  either  the  approximate  or  the  exact 
problem  by  assuming  W = exp(-  yx),  the  only  difference  being  in  the  relation  which  we 
find  between  y and  E.  If  we  substitute  this  expression  in  (2.  20),  we  find  a solution  of 
the  exact  equation  if  E = <f(0)  + 2(T(l)[l  + (vR)^/2.'  + (yR)^/4.'  + . . .j  = ^(O)  + 
2<^(l)  cosh  yR.  agreement  with  (2.  17).  If  on  the  contrary  we  neglect  fourth  and 
higher  derivatives,  we  have  E = <^(0)  + 2,^(1)  |'  + tyK)^'/^'j  - which  is  a satisfactory 
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approximation  only  if  (yR)^  is  small  in  compar  ison  to  unity.  This  is  the  case  in  which 
the  wave  function  exp(-  yx)  falls  off  by  only  a small  amount  in  the  distance  R,  which 
agr-ees  with  the  condition  for  the  applicability  of  the  second-order  differential  equation 
in  Section  1.  If  the  perturbation  is  greater,  so  that  the  wave  function  falls  off  more 
rapidly,  and  the  extra  level  is  further  from  the  band,  th*^  second -order  differential 
equation  is  inapplicable,  G.nd  we  must  use  the  difference  eouaiion,  or  the  di.ffei  ential 
equation  (2.  20)  of  infinite  order,  which  of  course  ’s  rigorously  equivalent  to  it. 

We  can  gain  additional  insight  into  our  problem  if  we  impose  definite  boundary 
conditions,  and  see  what  happens  to  each  of  the  energy  levels  and  wave  functions  as 
the  perturbation  increases  from  zero.  Thus,  let  us  impose  the  periodic  boundary 
condition  that  U(p  -c  2N)  = U(p),  where  N is  a definite  integer.  We  must  evidently 
have  perturbing  atoms  not  only  at  the  origin,  but  at  p = t 2N,  - 4N,  etc.  For  the 
antisymmetric  solutions  we  easily  see  that  our  functions  must  be  antisymmetric  with 
respect  to  the  point  p = N,  and  the  symmetric  solutions  are  symm.etric  about  this 
point.  For  the  antisymmetric  solutions  of  (2.  I 1),  we  may  set  U(p)  = sin  kRp,  and  the 
boundary  conditions  will  be  satisfied  if  kRN  = rni,  where  n is  an  integer.  We  thus 
have  solutions  for  kR  = n~/N  , where  in  the  usual  way  we  shall  find  independent  solu- 
tions only  for  the  N - i values  n = I,  2,  ...  N - 1;  the  values  n = 0,  hi  automatically 
give  nothing,  and  any  integral  values  of  n outside  this  range  repeat  values  of  U(p)  for 
each  atom  as  found  for  one  of  the  values  n = 1,  ...  N - 1. 

For  the  symmetric  solutions,  we  can  most  conveniently  proceed,  not  as  in  our 
earlier  discussion,  but  by  setting  U(p)  = cos  kR(p  - N),  which  will  automatically 
satisfy  the  boundary  condition  at  p = N.  The  energy  is  given  by  (2.  14),  as  before,  but 
now  in  place  of  (2.  17)  and  (2.  18)  we  find  easily 

- tan  kRN  sin  kR  = V(0)/2<^’(I).  (2.21) 

We  can  solve  this  equation  graphically,  by  plotting  the  left  side  as  a function  of  k,  and 
finding  the  k's  for  which  this  equals  V(0)/2(^(l).  A plot  of  this  function,  for  N = 6, 
as  a function  of  kR,  is  given  in  Fig.  1.  We  see  that  the  axis  of  abscissas  cuts  this 
curve  in  seven  places,  kR  = 0,  tt/N,  2ir/N,  . . . tt.  For  other  horizontal  lines,  where 
there  are  only  six  intersections,  it  is  easy  to  show  that  there  is  also  one  intersection 
corresponding  to  an  imaginary  value  of  k,  corresponding  to  tlie  bound  state  which  we 
have  been  discussing  in  the  preceding  paragraphs.  The  generalization  of  this  is  that 
there  are  N + 1 symmetric  states,  which  together  with  the  N - i antisymmetric  states 
give  2N  in  all.  This  is  the  number  which  we  should  expect  on  account  of  our  periodi- 
city. 

It  is  very  informing,  using  this  graphical  solution,  to  find  the  values  of  the 
N + 1 energy  values  of  the  symmetric  states,  as  functions  of  the  quantity  V(0)/2^(i)- 
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Energy  of  perturbed  energy  levels,  as  function  of  perturbative  energy. 


2.  THE  ONE -DIMENSIONAL  CASE 


These  values  are  plotted  in  Fig.  2;  we  show  also,  for  comparison,  the  values  of  the 
antisymmetric  energies,  which  are  not  affected  by  the  perturbation.  The  interesting 
thing  to  notice  is  the  way  in  which  each  of  the  energy  levels  within  the  band  is  dis- 
placed, by  the  perturbation,  only  far  enough  to  approach  the  midpoint  between  neighbor- 
ing antisymmetric  states,  with  the  exception  of  the  limiting  state,  which  becomes  de- 
tached from  the  band,  and  forms  the  separate  impurity  state  which  we  have  discussed 
earlier.  The  case  which  we  have  shown  is  that  of  positive  <^(l),  in  wiiich  case  it  is  the 
top  of  the  band  where  the  separate  l^vel  appears  for  positive  V(0).  We  have  show.n  the 
case  for  a small  N value;  but  if  N is  very  large,  it  is  clear  that  the  displacements  of 
all  levels  but  the  one  which  becomes  detached  from  the'  band  will  be  negligible. 

We  have  discussed  the  case  in  which  tne  discrete  level  becomes  detached  from 
the  band  at  k = 0.  That  is,  if  ^{l)  is  positive,  the  maximum  energy  of  the  band 
comes  at  k = 0,  and  a positive  value  of  V(0)  results  in  a level  rising  from  this  maxi- 
mum puiui.,  while  if  <o  (\)  and  V(G)  are  negative  a level  is  pushed  out  of  the  band  at  the 
energy  minimum,  which  again  comes  at  k = 0.  We  can  3.1so  have  the  case  where  the 
discrete  level  emerges  from  the  band  at  the  extreme  value  of  k,  equal  to  k = tr/R. 

Such  a case  arises  if  {l)  is  positive  and  V{0)  negative,  or  vice  versa.  This  follows 
formally  from  Eq.  (2.  21),  just  as  does  the  case  we  have  considered,  and  the  curves 
for  energy  form  the  part  of  Fig.  2 for  negative  values  of  V(0)/2^(l).  The  quantities 
U(p)  iiave  a different  form  in  this  case,  however,  as  we  should  deduce  from  Eq.  (2.  8) 
of  the  preceding  section.  This  case  corre.sponds  to  U(r)  = exp(ik^r)  W(r)  ri  Eq.  (2,  8), 
where  the  expression  exp(ik^r)  will  change  by  a factor  - 1 when  r increases  by  R.  In 
other  words,  the  solution  corresponding  to  a bound  state,  in  the  absence  of  the  bound- 
ary condition  that  U(p)  goes  to  zero  when  p = N,  will  here  have  the  form  L’{p)  = {-  1)^ 
exp(- yRp),  instead  of  the  value  exp(- yflp)  previously  encountered.  When  we  substi- 
tute this  value  in  Eq.  (2.  13),  we  find  that  we  have  a consistent  solution  for  our  pres- 
ent case  in  which  V(0)/2(^(l)  is  negative;  there  are  appropriate  changes  of  sign  in  the 
equations  which  v/e  obtain  as  substitutes  for  (2.  18)  and  (2.  19). 

We  have  noted  earlier  that  the  wave  .functions  of  the  states  in  the  band  become 
small  at  the  position  of  the  perturbing  atom,  as  the  perturbation  becomes  large.  We 
can  see  that  this  must  be  so  in  a very  fundamental  way.  Let  us  consider  the  case  of 
periodic  boundary  conditions.  We  are  building  up  wave  functions  as  linear  combina- 
tions of  the  2N  Wannier  functions  on  the  atoms  of  a fundamental  period.  These  Wan- 
nier  functions  are  an  orthonormal  set  of  functions,  as  are  the  final  linear  combina- 
tions, so  that  the  U(p)'s,  which  are  the  transformation  coefficients  from  one  set  of 
functions  to  the  other,  form  a unitary  matrix.  This  is  a matrix  with  2N  rows  and 
columns?  we  have  2N  energy  levels,  and  2N  p's.  If  we  now  form  the  sum  of  the  quan- 
tities U*{p)U(p),  for  a given  p value,  summed  over  the  2N  energy  levels,  we  know  on 
account  of  the  unitary  property  that  this  sum  must  be  unity.  Let  us  carry  out  this  sum 
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for  p = 0,  corresponding  to  the  impurity  atom.  F’or  the  separated  state,  with  large 
perturbative  potential  V(0),  v/e  know  that  the  wave  function  will  fall  off  as  exp(-  yF^p). 
where  y is  very  large,  so  that  i.ne  coefficients  U(p)  will  be  very  small  for  p / 0. 

Since  the  wave  functions  must  be  normalized,  we  must  have  the  sum  of  U>''(p)  U(p)  over 
p,  for  a given  energy  level,  equal  to  unity,  which  means  that  if  the  conti  ibutions  to 
the  sum  for  p / 0 are  very  small,  the  term  for  p = 0 must  be  very  neariy  equal  to 
unity.  vVe  now  return  Lo  our  statement  that  the  sum  of  U*(p)  U(p)  over  all  levels,  for 
a fixed  p,  must  equal  unity.  For  p = 0,  we  have  ju.st  .seen  that  the  contribution  from 
the  separated  state  is  almost  unity.  Therefore  the  contributions  from  all  other  levels 
must  be  very  small,  showing  that  with  a large  perturbation,  the  wave  functions  of  the 
states  in  the  band  avoid  the  perturbing  atom  almost  completely.  This  general  type  of 
IS  important,  for  it  can  be  used  in  more  complicated  cases,  where  we  cannot 
get  explicit  solutions. 

We  have  now  proceeded  about  as  far  as  possible  with  our  very  simple  one- 
dimensional  problem.  We  can  introduce  some  further  complications,  however,  and 
still  carry  through  an  exact  solution,  and  in  this  way  we  can  get  some  further  insight 
into  more  difficult  problems.  In  the  first  place,  we  can  include  second -nearest 
neighbor  interactions,  without  seriously  increasing  the  difficulty  of  the  problem. 

That  is,  in  Eq.  (2.  12),  we  can  include  a term  <^(2)  as  well  as  <^{l),  so  that  the  en- 
ergy expression,  similar  to  (2.  14),  becomes 

E = S (0)  + 2 (1)  cos  kR  + Z<^(Z)  cos  2kR.  (2.  22) 

With  suitable  choice  of  the  constants,  this  allows  us  to  have  the  energy  as  a function 


(D) 


Fig.  3 

Energy  vs.  kR,  for  second-nearest  neighbor  interaction. 
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of  kR  given  by  : iirve  of  th?  tyj’e  of  Fig.  3,  in  which  we  can  have  a minimum  or 
maximum  energy  coming  elsewhere  than  at  k = 0 or  tt/R.  Tiiis  corresponds  to  a 
minimum  or  m.aximum  energy  ccm.ing  sc:mcwhere  within  a Brillouin  ?one,  and  not  at 
a symineti'y  point,  in  a three-dimensional  pi'oLToni,  and  can  be  of  practical  impor- 
tance, for  there  is  some  reason  for  thinking  that  this  is  the  situation  actually  foumi 
in  germanium  in  the  conduction  band.  Let  us  consider  v/l.?t  happens  in  this  case  if 
we  introduce  a pt  rturbation  V(0),  like  that  of  Eq.  (2.  13). 

In  the  first  pla.ce,  we  shall  still  have  the  same  situation  as  before,  that  the 
solutions  arc  divided  into  antisymmetric  and  symmetric  functions,  and  that  the  energy 
of  the  antisymmetric  states  will  not  be  altered  by  the  perturbation.  We  also  have  tne 
pOh.sLbilily  of  setting  up  solutions  of  a typical  equation  of  type  (2.  13)  in  the  form 
exp(ikRp),  which  leads  at  once  to  the  energy  express'on  (2.  22).  Vve  see  now,  how- 
ever, that  we  have  four  solutions  for  k,  rather  than  the  two  (t  a single  value)  which 
we  had  in  the  earlier  case.  We  can  see  this  most  simply  from  Fig.  3.  If  we  cut  this 
curve  with  a horizontal  Ime  the  intersections  will  correspond  to  the  values  of  k for 
w'hich  we  get  solutions  of  our  difference  equation,  for  given  energy.  It  is  clear  that 
there  are  ranges  of  energy  for  which  there  are  four  intersections,  which  we  may  call 
t !<  I , t k^.  In  other  ranges  of  energy  some  of  these  intersections  become  imaginai'Y, 
but  it  can  still  be  shown  that  Eq.  (2.22)  has  four  different  solutions  for  k for  a given 
energy,  though  sometimes  the  k's  are  real,  sometimes  complex. 

Let  us  first  consider  the  case  where  the  k's  are  real.  The  euuattons  which 
we  must  satisfy,  similar  to  (2.  13),  include  the  typical  equation 

<f(0)  - eJu(p)  + <^{l)  [^U(p  + 1)  + U(p  - l)j  + cf(2)[u(p  + 2)  + U(p  - 2)]  =0  (2.23) 

which  is  satisfied  by  U(p)  = exp(ikRp),  provided  k satisfies  (2.  22).  This  equation 
holds  for  p 2.  The  additional  equations  are  different  depending  on  whether  we  ar-e 
dealing  with  antisymmetric  or  symmetric  solutions.  For  the  symmetric  case,  we 
have 


[.^(0)  + v(o)  - sj 

[,^(0)  - E IJ(1)  + 


U(0)  + 2<^’(1)U(1)  + 2^(2)  U{2)  = 0 
(1)  [u(2)  + U(0)j  + «?(2)  j^TJ{3)  + U(l)j 


0. 


(2.24) 


For  the  antisymmetric  case,  the  first  equation  of  the  form  of  (2.  24)  is  automatically 
satisfied,  and  the  second  reduces  to 


e]  U(l)  + <g’(l)U(2)  + ^(2)  [u(3)  - U(i)l 

j L j 


0. 


(2.  25) 
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It  is  instructive  to  examine  the  ?,T>ti ‘symmetric  case,  tliough  we  know  in  advance  that 
its  solution  is  as  in  the  unperlm'oed  case;  that  is,  we  know  that  there  are  two  independ- 
ent soimions,  U(p)  = sink.Rp  or  sin  k^Rp,  where  k^,  k,  are  the  two  independent 
solutions  of  (2.22).  Since  we  have  degeneracy,  with  two  independent  solutions  for  the 
same  energy,  any  linear  combination  of  these  two  functions  is  a solution  of  (2.  25),  as 
we  readily  verify.  We  can  see  that  there  must  be  two  independent  solutions  directly 
from  the  difference  equations.  From  (2.  25),  w'e  can  solve  fur  IJ(3)  in  terms  of  U(l) 
and  Tl(2)  (remembering  of  coarse  that  U(0)  = 0 for  the  antisymmetric  case)  From 
(2.  23)  we  can  then  solve  for  U(4)  in  terms  of  C(3),  U(2),  and  U(i),  which  means  that 
we  can  find  U(4)  in  terms  of  U(l)  and  U(2);  and  we  can  repeat  this  process  step  by 
step,  finding  each  further  value  of  U in  terms  of  U(l)  and  U(2),  writing  them  in  fact 
as  one  quantity  times  U(l)  plus  another  quantity  times  U(2).  We  can  get  a solution, 
then,  for  arbitrary  values  of  U(l)  and  U(2),  showing  that  there  must  be  two  arbitrary 
constants  in  the  solution.  This  is  merely  an  alternative  way  of  writing  the  solution 
as  a linear  combination  of  sin  k^Rp  and  sin  k^Rp. 

We  can  now  apply  the  same  sort  of  argument  to  the  symmetric  case,  to  find 
what  must  be  the  nature  of  the  solution.  From  the  first  equation  of  (2.  24),  we  can  find 
U(2)  as  a linear  combination  of  'J(O)  and  U(l),  for  a given  energy.  From  the  second 
equation  of  (2.  24),  we  can  find  U(3)  in  terms  of  U(0),  U(l),  and  U(2),  which  then  leads 
to  U(3)  as  a linear  combination  of  U(0)  and  U(l).  By  using  (2.  23)  we  can  extend  this 
process,  writing  each  of  the  U's  as  a linear  combination  of  U(0)  and  U(l),  so  that  again 
we  have  two  arbitrary  constants  in  the  solution.  It  is  nc  longer  trivial,  however,  to 
find  what  these  solutions  are.  I'he  complete  solution,  as  a matter  of  fact,  has  not 
been  worked  out,  the  algebra  being  involved,  but  we  can  state  the  general  nature  of  the 
solution.  We  can  write  U(p)  in  the  symmetric  case  as  a linear  combination  cos 
(kjRp  - Oj)  + cos  (k^Rp  - a^),  where  A^,  A^  are  amplitudes,  and  phases. 

Wc  can  suhstdute  this  solution  in  Eqs.  (2.  24),  which  then  must  be  sufficient,  since 
there  are  two  equations,  to  deleimine  two  of  the  four  arbitrary  constants,  which  we 
may  take  to  be  and  a^.  Within  limits,  we  can  find  a solution  for  any  pair  of  ampli- 

tudes A^,  A^-  It  is  clear,  however,  thai  this  cannot  be  done  for  completely  arbiti-ary 
amplitudes,  for  then  we  should  be  able  to  choose  one  of  the  amplitudes  to  be  zero, 
which  v/ould  imply  that  a single  function  cos  (k^Rp  - Uj)  or  cos  (k^Rp  - o-^)  could  fur- 
nish a solution.  This  is  clearly  impossible,  for  such  a function  is  a solution  of  the 
unperturbed  problem,  for  V(0)  = 0,  and  hence  cannot  satisfy  the  perturbed  problem. 
There  must  be,  then,  only  a limited  range  of  the  ratio  A^/A^,  not  extending  either  to 
zero  or  infinity,  over  which  our  conditions  can  be  satisfied  with  real  values  of  a;  out- 
side this  range  Oj  or  or  both  will  have  to  become  complex.  T!  is  range  will  be- 
come larger  as  V(0)  approaches  zero. 

We  can  then  take  tv/o  independent  solutions,  say  those  for  which  A^/Aj  has  its 
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maximum  and  minimum  allowable  valuers,  and  use  these  as  two  independent  functions, 
building  up  all  others  as  linear  combinations  of  these  two.  One  of  them  will  consist 
largely  of  the  function  cos  (kjHp  - a^),  with  a minimum  contribution  of  cos  (k^Kp  - a^), 
and  the  other  '’  ill  have  a maximum  contribution  of  cos  (k^Rp  - a^)  and  a minimum 
amount  of  cos  (k  Rp  - a,).  The  effect  of  the  perturbation,  in  other  words,  is  to  mix 
the  two  waves  corresponding  to  the  two  different  k value.s  together,  the  mixing  in- 
creasing as  Ihe  size  of  the  perturbation  increases.  Tt  is  as  if  the  wave  of  one  K value, 
when  Scattered  by  the  perturbing  atom,  scatters  contributions  of  waves  of  botli  k 
values  corresponding  to  the  same  energy. 

We  shall  see  later  that  there  is  a bound  state  neai’  the  perturbing  atom,  being 
concentrated  more  and  more  on  this  atom  as  the  perturbation  becomes  larger.  Thus 
we  can  conclude,  just  as  in  the  case  of  nearest-neighbor  interaction,  that  tne  wave 
functions  of  the  other  states  must  beconie  vanishingly  small  at  me  perturbing  atom, 
or  U(0)  must  approach  zero,  as  the  perturbation  becomes  infinite.  We  can  also,  as 
in  the  case  o:;'  tlie  neares.t  neighbor  interaction,  impose  boundary  conditions,  such  as 
making  U(p  + 2N)  = L'(p).  In  this  case,  as  before,  we  shall  find  N - 1 antisymmetric 
solutions.  These  will  come  when  kR  = mr/N,  just  as  in  the  earlier  case.  For  such 
a k value,  in  general  the  ott  er  k value  corresponding  to  the  same  energy  will  not 
satisfy  the  periodic  boundary  conditions,  so  that  the  wave  function  will  be  just  U(p)  ~ 
sin  kRp,  with  the  (N  - 1)  values  of  k given  above.  For  the  symmetric  solutions,  the 
situation  is  luoie  complicated.  But  we  expect  to  find,  as  before,  that  there  are  N + 1 
symmetric  solutions,  of  which  one  becomes  detached  from  the  band  as  a discrete 
state  under  the  action  of  the  perturbation. 

Our  main  interest  is  in  this  discrete  state,  and  we  can  discuss  it  in  a some- 
what different  way,  by  analogy  to  the  method  used  for  the  case  of  nearest -neighbor 
interaction.  Let  us  consider  what  happens  when  the  energy  E in  Eq.  (2.  22),  and  in 
Fig.  3,  decreases.  In  the  case  shown  by  (a).  Fig.  3,  a line  of  constant  energy  inter- 
sects the  curve  in  four  places;  this  is  the  case  which  we  have  been  discussing.  As 
E decreases,  however,  we  come  to  a point  where  kj  and  k^  become  equal  to  each 
other,  and  then  there  is  no  further  real  intersection.  Examination  of  the  form  of 
Eq.  (2.  22)  shows  that  at  this  point  the  values  of  k for  the  intersections  become  com- 
plex, the  four  solutions  of  (2.  22)  having  ihe  form  t k^  t iy,  where  and  y are  real, 
y increasing  as  the  energy  E decreases  below  the  lowest  point  on  the  curve  of  Fig.  3, 
or  below  the  bottom  of  the  energy  band.  The  quantities  U(p)  then  can  be  written  in  the 
form  k Rp  exp(t  vRp).  We  are  interested  only  in  a symmetric  sci'ution;  there  are 
no  antisymmetric  solutions  corresponding  to  bound  states.  Ob’.iously  we  wish  only  a 
solution  which  vanishes  exponentially  as  p becomes  large.  Thus  for  positive  p we 
may  take  U(p)  = cos  (k^Rp  - a)  exp(-  yRp).  This  form  of  expression  reminds  us  of 
Eq.  (2.  8),  where  we  had  an  energy  minimum  at  a point  corresponding  to  a propagation 
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vector  k^,  and  where  the  solution  was  exp(ik^r)  multiplied  by  a slowly  varying  function, 
which  in  our  case  corresponds  to  the  exponential  function.  If  now  we  substitute  our 
solution  in  Eq.  (2.  24),  we  find  tliat  by  rather  tedious  algebra  we  can  solve  for  k^,  and 
the  energy,  as  functions  7.  That  is,  we  fin^i  a bound  state  of  a given  energy,  and 
its  qualitative  behavior  is  as  in  tht;  case  of  nearest-neighbor  interaction:  as  the  per- 
turbative potential  V(0)  is  increased  numerically  (it  must  of  course  be  negative  to 
make  the  discrete  level  fall  below  the  bcttoin  of  the  band),  the  discrete  level  falls  in 
energy,  and  its  y increases,  so  tliat  the  wave  functions  falls  off  more  rapidly  as  we 
go  away  from  the  perturbing  atom.  In  other  words,  as  far  as  the  qualitative  situation 
is  concerned,  our  second  nearest -neighbor  interaction  has  not  changed  the  properties 
of  the  discrete  state.  Now,  howevei',  it  detaches  itself  from  the  minimum  of  the  en- 
ergy curve,  which  no  longer  lies  at  the  center  or  boundary  of  the  Brillouin  zone,  and 
as  a result  of  this  the  wave  function  U(p)  has  the  modulation  factor  cos  (k^^Hp  - a),  as 
well  as  falling  off  exponentially  with  p. 

We  have  now  considered  two  of  the  possible  four  cases  shown  in  Fig.  3:  the 

case  where  the  energy  is  suen  that  there  are  four  real  values  of  k,  and  thtt  where  the 
energy  falls  below  the  bott.om  of  the  band,  and  tliere  are  four  complex  values  of  k,  as 
shown  in  (a)  and  (b)  in  Fig.  3.  The  case  of  (c).  Fig.  3,  is  one  where  there  are  two 
real  k values,  and  two  pure  imaginary  values;  as  the  energy  rises  from  a value  like 
(a)  to  that  like  (c),  two  solutions  coalesce  at  k = 0,  at  the  energy  maximum  at  the  cen- 
ter, and  there  they  become  imaginary.  No  discrete  level  appears  at  this  point,  how- 
ever, even  with  a positive  value  of  V(0),  though  there  would  be  a discrete  level  if  an 
energy  like  (c)  lay  entirely  outsidi;  the  energy  band.  Instead,  if  we  call  the  real  value 
of  k by  the  symbol  kj,  the  imaginary  one  i\,  then  we  build  up  a solution  of  the  form  of 
a linear  combination  of  cos  (k,Hp  - a)  and  of  exp(  - yRp),  for  positive  p's.  Both  types 
of  terms  are  required  to  satisfy  Eqs.  (2.  24),  and  we  shall  find  solutions  of  this  general 
type,  corresponding  to  an  enhancement  of  the  wave  function  near  the  perturbing  atom, 
but  merging  into  an  ordinary  cosine -like  wave  function  at  a distance  from  this  atom. 
Finally,  in  the  case  (d),  where  ttie  energy  is  entirely  above  the  band,  we  find  two  dif- 
ferent vaii’es  of  y.  We  build  up  combinations  of  the  two  appropriate  exponentials,  fall- 
ing off  as  v'e  depart  from  the  perturbing  atom,  and  again  firiu,  for  positive  V(0),  an 
impurity  level  detaching  itself  from  the  top  of  the  energy  band,  becoming  further  sepa- 
rated as  V(0)  increases,  with  the  wave  function  falling  off  .more  and  more  rapidly  as 
the  perturbation  becomes  larger  and  larger. 

This  sketch  of  the  case  where  second-nearest-neighbor  inter  tictions  con  — 
sidered  shows  that  the  general  situation  is  similar  to  that  with  only  nearest-neighbor 
interactions.  >^/e  shall  now  consider  another  form  of  generalization  of  our  simple 
nearest-neighbor  ca.se.  This  comes  when  the  perturbation  V occui-s  on  more  than  ore 
atom  As  an  example,  we  may  suppose  tiiat  it  occurs  on  three  adjacent  atoms,  in  a 
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symmetrical  form.  That  is,  we  may  suppose  that  V(l)  is  different  from  zero,  as  well 
as  V{0),  but  that  V(-  1)  = V(l)  7'his  would  allow  us  to  simulate  a case  in  which,  for 
instance,  ihere  was  a screened  Coulomb  interaction  between  the  impurity  atom  and  its 
neighbors,  falling  off  rapidly  with  distance  (so  that  V(i)  would  be  numerically  smaller 
than  V(0)),  but  appreciable  at  the  nearest  neighbor  of  the  perturbing  atom.  We  find  by 
good  fortune  that  Eqs.  (2.  24)  are  not  appreciably  more  difficult  to  solve  in  this  case 
than  v/hen  V(l)  is  .absent.  Thus  if  we  have  second-nearest -.neighbor  interactions,  and 
also  perturbations  on  the  atoms  next  to  the  central  atom,  we  can  still  get  analytic  solu- 
tions of  the  problem  We  find  that  the  situation  is  altered  in  an  interesting  way;  we 
now  find  three  energy  levels  splitting  off  from  the  band  under  the  action  of  the  perturba- 
tion, rather  than  one  only.  As  the  perturbation  becomes  large,  we  may  describe  these 
states  as  follows:  one  of  them  has  U(l),  U(0),  and  U(-  1),  all  of  the  same  sign,  and  is 
a symmetric  solution;  another  is  a symmetric  solution  in  which  U(l)  and  U(-  l),  which 
are  equal  to  each  other,  have  the  opposite  sign  to  U(0);  and  the  third  is  antisymmetric, 
with  U(0)  = 0,  U(-  1)  = - U(l).  These  have  the  same  arrangement  formally,  then,  as 
the  three  modes  of  a string  carrying  three  weights.  We  may  generalize  this  result;  if 
large  energy  perturbations  are  applied  to  n adjacent  atomic  sites,  we  shall  have  n 
discrete  levels  splitting  off  from  the  band,  as  the  perturbations  increase  from  zero.  It 
is  only  in  the  special  case  where  the  perturbations  apply  to  only  one  atom  that  there  is 
only  one  discrete  level. 

If  there  are  n discrete  levels,  locu  'd  on  n adjacent  atoms,  widely  separated 
from  the  rest  of  the  Daiid  for  large  perturba  .ve  potentials,  then  an  argument  based  on 
the  unitary  nature  of  our  transformation  will  *ell  us,  just  as  in  the  simpler  cases,  that 
the  wave  functions  of  the  band  will  avoid  these  n atoms  for  large  perturbation.  In  the 
limit,  w'e  can  obviously  have  a splitting  of  the  band  into  two  sub-bands.  Eor  instance, 
if  n adjacent  atoms  have  a perturbative  term  V(0)  in  their  potential,  equal  on  all  of 
them,  and  then  the  ne  :t  n atoms  do  not  have  such  a term,  and  after  that  we  alternate 
between  perturbed  and  unperturbed  atoms,  we  shall  split  half  the  levels  away  from 
the  other  half,  and  shall  have  effectively  a band  split  in  the  middle.  Situations  such 
as  this  are  met  in  molecular  crystals. 

Sometimes,  instead  of  having  a situation  like  that  just  described,  where  all  the 
perturbed  atoms  have  potentials  differing  by  a large  amount  from  the  unperturbed  ones, 
we  have  quite  a different  situation,  of  which  the  Coulomb  potential  is  typical.  This  of 
coui  se  is  a three-dimensional  problem,  in  which  the  perturbing  potential  is  a Coulomb 
potential  centered  on  a perturbing  atom,  so  that  the  perturbative  potential  V,  though  it 
falls  off  fairly  rapidly  with  distance  from  the  perturbing  atom,  still  extends  out  with 
appreciable  amplitude  out  to  distant  atoms.  We  know  that  the  corresponding  hydrogen 
problem  has  an  infinite  number  of  discrete  energy  levels,  below  the  .'ontinuuin.  A 
one -dimensional  version  of  this  problem,  in  a similar  way,  will  net  show  a finite  num- 
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ber  of  discrete  levels  definitely  separated  from  a largely  unperturbed  band.  Rather, 
all  th>,  levels  Vvill  be  perturhr<"!  the  lowes+  onr»s  mn.st  (assuming  that  the  pei'turbation 
is  of  the  type  to  depress  the  discrete  levels),  but  o*her?^  more  or  less  a<  cording  to  a 
Rydberg  law.  We  might  think,  \n  such  a case,  that  since  all  the  states  have  a certain 
resemblance  to  bound  states,  the  charge  distribution  produced  v^hen  all  states  were  oc- 
cupien  would  tend  to  concentrate  around  the  impurity  atom  That  this  is  not  the  case 
is  seen  from  our  argument  in  terms  of  the  unitary  transformations:  if  all  states  in  the 

perturbed  band  are  occupied,  then  the  electrons  will  be  uniformly  distributed  around 
all  atoms,  the  rather  concentrated  discrete  state  wave  functions  being  balanced  by 
higher  energy  levels,  in  which  the  wave  functions  tend  to  avoid  the  perturbed  atom. 

3.  The  Two-  and  Three-Dimensional  Cases 

The  two-  and  three-dimensional  problems  are  very  much  more  difficult  than 
the  one-dimensional  case  which  we  have  just  been  taking  up.  The  difficulty  arises  from 
various  sources.  In  the  first  place,  we  have  the  same  soi  t of  problems  which  arise 
in  differential  equations  in  two  and  three  dimensions:  we  are  dealing  with  the  ana- 
logues of  partial  differential  equations,  which  do  not  have  simple  solutions  except  in 
cases  of  separability,  and  most  of  our  cases  do  not  correspond  to  separable  equations. 
Furthermore,  we  have  the  symmetry  properties  which  become  so  impoidant  in  two  and 
three  dimensions.  Not  only  must  our  solutions  of  the  pi  oblem  in  terms  of  Wannier 
functions  show  proper  group-theoretical  behavior;  we  must  consider  also  the  sym- 
metry properties  of  the  Wannier  functions  themselves.  This  introduces  us  to  a ques- 
tion which  was  not  emphasized  in  Chapter  1;  the  nature  of  the  Wannier  functions  in 
case  we  have  degenerate  bands,  such  as  the  p or  d bands.  We  shall  have  to  go  into 
such  questions  in  detail,  before  v'e  can  understand  the  three-dimensional  case.  In  our 
first  discussion,  however,  we  shall  simplify  things  by  not  considering  degenerate 
bands;  v/e  shall  deal  with  s-like  bands,  and  Wannier  functions  with  s-like  .symmetry, 
amd  shall  continue,  as  in  the  preceding  section,  to  assume  that  we  can  gel  a good 
representation  of  the  perturbed  problem  by  using  Wannier  functions  from  only  one 
band.  We  shall  remove  this  restriction  in  a later  section,  where  we  shall  take  up  the 
effect  of  overlapping  bands  on  the  nature  of  impurity  levels.  Furthermore,  in  the 
present  section,  we  shall  deal  mainly  with  the  case  where  the  perturbative  potential  is 
located  only  on  one  perturbing  atom,  as  in  the  larger  part  of  the  discussion  of  the  pre- 
ceding section.  In  this  way,  we  get  as  close  an  approach  as  v/e  can  to  a problem  of 
spherical  symmetry,  and  we  can  use  some  of  the  results  of  the  separation  of  variables 
in  spherical  coordinates  which  w-e  should  meet  in  the  correspond' ng  problem  of  a free 
particle. 

The  discussion  of  Section  1 was  of  course  general,  applying  to  the  two-  and 
three-dimensional  cases.  Hence  we  can  conclude  at  once  that  when  the  wave  function 
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varies  slowly  with  position,  v/e  car.  replace  our  difference  equation  by  a sec<ind -order 
differential  equation,  and  can  use  (Z.  10)  to  solve  our  problem.  Furthermore,  with  an 
energy  band  of  s-like  symmetry,  the  effective  masses  m^,  rn^,  m^  will  aii  be  equir.l. 
Hence  if  the  potential  V has  spherical  symmetry,  our  Schrodinger  problem  (2.  10)  will 
be  separable  in  spherical  coordinates,  and  we  can  really  solve  our  problem.  Let  us 
then  consider  this  soluble  case  in  .come  detail,  as  a guide  to  what  to  expect  in  more 
complicated  cases  where  an  exact  solution  is  impossible.  We  cuiisiuer  first  the  case- 
in which,  as  in  the  one-dimensional  pi-oblem,  V has  a diagonal  matrix,  having  no  com- 
ponents between  Wannier  functions  belonging  to  different  bands  or  on  different  atcm.s, 
and  v.'here  furthermore  the  diagonal  component  is  zero  except  on  one  perturbing  atom, 
v.'hich  we  may  take  to  be  at  the  origin. 

In  this  case,  the  approximate  Schrodinger  equation  (2.  iO)  has  V = 0 everywhere 
except  at  the  origin,  and  hence  is  the  equation  for  a free  particle.  The  solutions  which 
we  wish,  then,  must  be  spherical  harmonics  of  the  angle,  times  spherical  Bes.sel  func- 
tions of  the  radial  distance  out  from  the  perturbing  atonri.  We  must  choose  such  a solu- 
tion as  to  have  a suitable  singularity  at  the  origin,  so  that  if  we  take  the  values  of  this 
solution  at  the  various  atoms,  treat  these  values  as  the  quantities  U(R.),  and  substitute 
in  the  difference  equation  (2.  5),  we  shall  have  a solution  of  this  difference  equation 
even  at  the  origin.  From  our  knowledge  of  the  solutions  of  the  wave  equation  in  sphe 
cai  coordinates,  we  know  that  the  wave  functions  of  all  these  cases  except  for  i = 0, 
where  i is  the  azimuthal  quantum  number,  will  vanish  at  the  origin.  Since  the  per- 
turbation is  located  only  at  the  origin,  this  means  that  only  the  states  for  J = 0,  or  the 
s Siates,  will  be  perturbed^  this  is  analog-ous  to  the  situation  in  one  dimension,  '-vhere 
the  sym.metric  wave  functions  were  perturbed,  ihe  antisymmetric  ones  were  not.  Wc 
may  limit  our  discussion,  then,  to  s states,  independent  of  angle,  and  in  which  the 
corresponding  spherical  Bessel  function,  expressing  the  dependence  of  the  wave  func- 

tion  on  r,  has  the  form  of  — - — or  ^ — or  — -L~  JL  ' depending  on  circumstances. 

sin  Ivi* 

We  note  that  all  these  functions,  except  the  particular  combinations — or  , 

become  infinite  at  the  origin;  there  is  no  one  of  these  functions,  in  the  range  where 
they  vaiy  e.xponentially  with  distance,  which  simultaneously  goes  to  zero  at  infinity, 
and  is  finite  at  the  origin.  This  as  once  shows  us  that  the  behavior  of  our  function  at 
the  origin  is  more  complicated  than  in  one  dimension,  and  that  we  must  give  special 
attention  to  the  problem. 

We  can  understand  v.'hat  is  going  on  by  considering  the  sichrodinger  problem  of 
a free  particle  in  a spherical  square-well  potential.  That  is,  we  let  the  particle  move 
in  a potential  which  is  zero  except  within  a small  sphere  of  radius  R^,  where  it  equals 
- V^.  If  we  choose  this  sphere  to  be  of  atomic  dimensions,  then  this  problem  may 
serve  as  a sort  of  analogue  to  our  actual  case,  the  potential  well  at  the  oi  Igin  corres- 
ponding to  the  perturbation  V(0)  found  at  the  central  atom.  In  a similar  way,  in  the 


HF  PERTURBED  PERIODIC  LATTICE 


one -dimensional  case,  we  could  have  taken  a free  parnicle  moving  in  a field-iree  space, 
except  for  a square  well  where  the  potential  is  - located  in  the  immediate  neighbor- 
iiood  of  the  origin.  This  problem  shows  strong  analogies  to  the  one -dimensional  case 
which  we  have  taken  up  in  the  preceding  section.  For  any  value  of  even  a very 
small  one,  a bound  state  is  set  up.  Its  energy,  as  a function  of  V , shows  a behavior' 

similar  to  that  of  Eq.  (2.  19):  it  is  proportional  to  ' for  small  V^,  to  for  large 

V^,  and  the  wave  function  falls  off  exponemiaily  as  we  go  away  from  the  perturbation, 

with  a value  of  y similar  to  tliat  of  Eq.  (2.  18).  In  our  present  case,  hov/ever,  the 

situation  is  quite  different,  as  is  known  from  elementary  quantum  mechar’cs  We  find 
that  if  the  potential  well  is  not  deep  or  wide  enough,  there  is  no  bound  state  at  all. 

This  can  have  quite  profound  effects  on  the  natur  e of  the  impurity  .levels. 

This  'liffei’erice  between  the  one-  and  three-dimensional  cases  is  so  important 
that  we  must  examine  the  reasons  for  it  in  detail.  We  remember  that  Schrddinger's 
equation,  in  spherical  coordinates,  for  the  case  where  the  wave  function  is  independent 
of  angle,  can  be  written  in  the  form 

.2  ,2 

- — - — (ru)  + V(ru)  = E(ru),  (2.  26) 

SiT^m  dr^ 

where  u is  the  Scni  udinger  wave  function.  That  is,  if  V is  zero,  as  we  have  outside 
our  potential  well,  the  quantity  ru  satisfies  the  differential  equation  for  a free  particle, 
so  that  it  can  be  written  in  the  form  exp(ikr),  where  k = ^(2mE)^^^.  Inside  the  poten- 
tial  well,  in  a similar  way,  where  the  potential  energy  is  assumed  to  be  - V^,  we  have 
a solution  of  the  form  exp(ik  r),  where  k = — (2rn(E  + V For  a bound  state, 

we  must  assume  that  E ■+•  is  positive,  but  that  E,  the  kinetic  energy  outside  the 
potential  well,  is  negative.  Inside,  then,  we  must  use  a solution  ru  = sin  k^r;  for 
only  by  using  such  a solution  will  the  actual  wave  function  u be  finite  at  the  origin. 

We  must  use  outside  a solution  exp(-'yr),  'vvhere  y = — (-2mE)^^^,  the  energy  being 
negative.  These  two  functions  must  join  smoothly  at  the  boundary  of  the  well.  Now 
the  exponential  must  be  sloping  down  at  the  boundary  of  ihe  well,  or  in  the  limit  where 
y is  very  small  it  will  have  a horizontal  tangent.  Thus  we  cannot  join  the  functions 
smoothly  unless  the  sine  function  has  gone  thi  ough  at  least  a quarter  wave  length  with- 
in the  well,  so  .as  to  rise  to  its  first  maximum  at  the  boundary  of  the  weii.  In  this 
limit,  then,  '.where  E is  zero,  and  the  wave  length  within  the  well  is  h(2mV^) 

the  condition  for  the  existence  of  a bound  state  is  that  4R  should  be  at  least  a wave 

2 2 ^ 

length,  or  that  must  be  at  least  h /(32mR^  ). 

'foi'  a discussion,  see  for  instance  L.  Schitf,  Quantum  Mechanics,  (McGraw-Hill 
Hook  Company,  Inc.  , New  York)  1949,  p^  77T 
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Of  course,  we  can  have  smaller  values  of  than  this;  and  we  naturally  ask 
what  (lappens  to  the  wave  function  in  such  a case.  The  answer  is  clear  from  the  dis- 
cussion we  have  just  given:  the  wave  function  inside  the  well  joins  at  the  boundary  of 
the  v/ell  onto  a sinusoidal  function  outside,  cori  esponding  to  positive  energy.  If  v/e 
consiaer  a finite  volume  within  which  the  electrons  move,  freely  except  for  the  one 


potential  well  within  the  volume,  then  if  the  depth  of  the  well  is  zero,  w<^  shall  have 
discrete  energy  levels,  just  as  in  the  free-electron  model  of  a conductor.  The  lowest 
one  of  these  energy  levels  will  have  a very  small  but  finite  energy:  the  wave  length 
corresponding  to  its  wave  must  be  equal  to  twice  the  dimension  of  the  cavity,  or  some 
such  thing,  depending  on  the  details  of  the  boundary  conditions.  Now  let  the  depth  of 
the  potential  well  increase  from  zero.  All  the  wave  functions  and  energy  levels  will 
be  slov'ly  modified,  and  the  energy  of  the  lowest  level  will  be  gradually  reduced  to 
zero.  It  will  not  reach  zero,  however,  until  the  well  has  reached  its  critical  size. 


At  this  point,  the  wave  function  will  be  like 


sin  knr 


within  the  well,  the  sine  having  al- 


most exactly  zero  slope  at  the  boundary  of  the  w'cll;  outside  the  well,  the  wave  func- 
tion will  be  almost  exactly  l/r,  fitting  smoothly  onto  the  function  within  the  well,  and 
being  very  slightly  modified  for  very  large  values  of  r to  fit  whatever  boundary  con- 
ditions we  impose  around  the  boundary  of  the  cavity.  As  the  well  becomes  still  deeper, 
this  lowest  level  will  fall  below  the  ^alue  of  zero  corresponding  to  the  bottom  of  the 

band.  Closer  examination  shows  that  for  small  increases  of  V over  the  critical  value, 

2 ° 

the  energy  will  be  proportional  to  (V^  - . l>ut  at  sufficiently  great  V^,  the  en- 

The  wave  function  will  become  more  and  more  con- 


ergy  will  be  proportional  to  V^.  The  wave  function  will  become  more  and  more  con- 
centrated in  the  potential  well  as  V increases,  falling  off  mo<  e ami  jnore  rapidly  in 

o 

the  exponential  region  outside  the  ’.veil,  where  it  has  the  form  exp(-  yr)/r.  Finally, 
for  an  extremely  deep  well,  the  wave  function  will  be  negligible  outside  the  well,  and 
v/ill  become  identical  with  that  of  a particle  in  a well  with  infinitely  high  barriers  at 
the  boundary. 

This  sketch  of  the  situation  shows  the  general  nature  of  our  solution  for  the 
bound  state.  There  is  now  one  very  important  feature  ccni;8cted  ’.vith  it:  in  any  ca.se 
where  a bound  state  exists,  the  wave  function  changes  so  rapidly  with  position  that  we 
are  not  justified  in  replacing  the  difference  equation  by  a differential  equation.  We 
have  just  seen  that  the  minimum  depth  of  well  gives  a wave  function  proportional  to 
l/r  outside  the  well,  and  for  deeper  wells  it  falls  off  more  rapidly,  as  exp(-  yi  '/r. 

But  even  the  function  l/r  varies  so  rapidly  with  position  that  we  are  not  at  at)  justified 
in  replacing  differences  by  derivatives,  or,  what  amounts  to  the  same  thing,  disregard- 
ing fourth  and  higher  derivatives.  The  wave  function  decreases  by  a factor  of  2 in 
going  from  an  atom  which  is  a nearest  neighbor  of  that  at  the  origin,  to  an  atom  t'wice 
as  far  away.  Such  a variation  cannot  be  regarded  as  small,  by  any  stretch  of  the  word. 
We  thus  conclude  that  it  is  never  legitimate,  in  a three-dimensional  problem,  to  re- 
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place  the  difference  equation  by  a differ  euUal  equation,  in  the  problem  where  there  is 
a perturbation  at  only  one  point.  We  may  still  expect  the  discussion  which  we  have 
just  given  to  held  in  a qualitative  manner,  b”t  cannot  expeoi  it  to  t)e  quantitatively  cor- 
rect. We  shall  then  pass  on  to  the  differ!  nee  equation,  and  shall  show  that  in  fact  the 
qualitative  behavior  is  just  as  we  have  described,  but  that  fortunately  we  can  handle 
the  problem  quantitatively  as  well  as  qualitatively. 


Let  us  ti 


:ase 


simple  cubic  lattice  in  thr^e  dimensions,  with  onlj' 


nearest-neighbor  interactions.  Taking  only  a non-degenerate  band,  with  s-like  sym- 
metry, and  considering  only  the  Wannier  functions  of  this  band,  our  difference  equa- 
tions (?.■  'i)  then  become 


Fir/A\  T7.1tT#_  1 ' 1 ...  J TTIa 

\v/;  - C.J  o VP,  q,  1 / 1 V * / \P  ~ , M>  ^ tp 


T ’ i n n 
- \r-  » n 


+ ! , r)  + tl(r.,  q - L r) 
t U(p,  q,  r + 1)  t U(p,  q,  r - l)j  = 0,  unless  p = q = r = 0 

(1 

j<?(0)  + V{0)  - e]  U(0,  0,  0)  + &^(l)  [uu.  0,  0)  + U(-  1,0,0)  + U(0,  1,0)  + U(0,  -1,0) 

+ U(0,  0,  1)  + U(0,  0,  - l)j  =0 


(2.  27) 


Here  we  have  numbered  the  atoms,  by  Uie  three  integral  indices  p,  q,  r.  These  differ  - 
ence equations  are  exactly  equivalent  to  Eq.  (2.  13)  for  the  one -dimensional  case.  Our 
problem  has  the  symmetry  properties  of  the  cube,  and  our  solution  must  then  form  a 
basis  function  for  an  irreducible  representation  of  the  group.  We  know  that  we  are 
looking  for  a solution  which  is  an  even  function  of  p,  of  q,  and  of  r.  This  is  the  same 
type  of  symmetry  shown  by  the  s function  in  the  free  electron  problem;  only  here, 
since  we  do  not  have  spherical  symmetry,  we  must  not  expect  that  the  continuous  func- 
tion U(p,  q,  r),  whose  values  for  integral  values  of  p,  q,  r we  desire,  will  show  shperi- 
cal  symmetry. 

Our  difference  equations  (2.  27)  fortunately  can  be  solved,  tiiough  the  solution 
does  not  seem  to  appear  in  the  literature.  The  writer  is  indebted  to  Dr.  H.  C. 
Schweinler  for  pointing  out  that  an  analogous  problem  has  been  solved  by  McCrae  and 
Wdipple,  and  to  Dr.  G.  F.  Koster  for  showing  that  the  same  method  cf  solution  can 
be  used  for  the  present  problem.  The  problem  is  much  more  involved  than  in  one 
dimension.  We  cannot  make  any  direct  use  of  the  spherical  symmetry  of  the  differen- 
tial equation  which  approximately  represents  the  problem,  for  the  real  solution  is  not 
spherically  symmetrical,  and  there  is  in  fact  no  way  to  separate  variables.  We  can, 
however,  of  course  set  up  solutions,  in  this  case  an  infinite  number  of  them,  for  the 

^W.  H.  McCrea  and  F.  J.  W.  Whipple,  Proc.  Roy.  Soc.  (Ediriburgh)  tO,  281  (1940). 
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general  equation  of  (Z.  27),  and  the  procedure  used  is  to  w/rite  our  solutiti;’.  as  a sum  of 
this  infinite  '^umber  of  terms,  and  then  apply  suitable  limiting  conditions  to  take  care 
of  ihe  special  equation  coi  i'esporidijig  to  p = q = r = 0. 

The  method  of  solution  introduces  an  app.arent  lack  ol  symmetry  between  the 
three  directions  in  space:  one  axis,  which  we  shall  lake  to  be  the  r axis,  must  be 
treated  differently  from  the  others.  Though  the  solution  appears  unsyrn  .netrical,  this 
is  only  in  the  way  of  writing  it,  not  in  the  solution  itself.  Let  us  take  a function 
lUp,  q,  r)  = cos  kjHp  cos  k^Kq  exp(-  yRr).  If  we  substitute  this  function  in  the  first 
equation  of  (2.  27),  we  find  that  we  have  a solution  provided 

E = + 2 c^tl)  j^cos  kjR  t CCS  k^R  cosh  . (2.28) 

Since  k,  and  k^  can  take  on  arbitrary  values,  we  have  a double  infinity  of  solutions  of 
this  type,  for  a given  energy  value;  the  quantity  y is  determined  from  (2.  28)  in  terms 
of  kj,  k^,  and  E.  Now  let  us  make  up  a solution  as  a sum  of  an  inf.nite  number  of 
such  terms,  ail  cor  -esponding  to  the  same  E;  it  will  clearly  be  a solution  of  the  first 
equation  of  (2.  27),  and  we  have  merely  to  satisfy  the  second  equation.  Before  doing 
this,  however,  we  must  look  a little  more  closely  at  boundary  conditions  at  large  dis- 
tances. 

We  are  looking  specifically  for  the  bound  state.  Thus  for  positive  values  of  '' 
we  wish  to  use  the  exponential  exp(-  yRr),  which  will  go  to  zero  at  infinite  values  of 
r.  However,  for  negative  r we  must  have  a solution  which  is  symmetric  with  this, 
so  that  we  must  use  a similar  sum  of  terms  with  exp(yRr).  we  shall,  then,  have  a 
condition  of  joining  of  these  two  solutions  along  the  plane  ;•  = 0,  as  well  as  the  con- 
dition at  p = q = r = 0.  We  also  wish  the  function  to  go  to  zero  at  infinite  positive 
and  negative  values  of  p and  q.  To  accomplish  this,  we  shall  arrange  to  have  the 
solution  go  to  zero  on  the  planes  p - t N,  q = i N,  where  N is  a large  integer,  and 
eventually  we  shall  let  N become  hTinite.  We  can  accomplish  this  if  we  limit  kjR, 
and  k^R,  to  the  N values  ■n^/2N,  3ir/2N,  . . . (2N  - 1)tt/2N;  for  then  each  of  the  cosine 
functions  will  become  zero  at  the  limits.  We  can  then  write  our  solution  in  the  form 

U(p,q,  r)  = ^{kj,  k^)  c(k  j,  k,)  cos  kjRp  cos  k^Rq  exp(-  yRr),  (2.29) 

where  the  k's  are  to  go  over  the  values  just  stated,  where  the  c's  are  coefficients  to 
be  determined,  and  where  the  soluticr  written  above  applies  only  for  positive  r;  for 
negative  r,  we  use  the  exponential  e,<c;  'yRr).  The  quantity  y is  to  be  determined  in 
terms  of  k,  and  by  Eq.  (2.  28).  V e shall  now  apply  the  conditii.n  of  continuily  over 
the  plane  r = 0,  but  for  the  ease  here  p and  q are  not  boUi  equal  to  zero. 

Tc  do  this,  we  substitute  i i 29)  in  the  first  equation  of  (2.  27),  for  the  special 
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case  wliere  r = 0,  remembering  the  situation  for  negative  rU>.  We  find  at  once  that 
the  result,  when  we  make  use  of  {2.  28),  reduces  to  the  statement  that  the  expression 
(2.  29),  computed  for  r = - I,  must  equal  the  same  expression  computed  for  r = 1. 

In  other  words,  though  the  expansion  (2.  29)  is  correct  only  fc^  positive  r's,  this  shows 
that  it  must  still  converge  for  r = - i (so  long  as  we  do  nut  liave  p = o = 0),  and  must 
give  the  co’'rect  value  of  the  function  at  such  a point.  Wo  csui  then  wr  e this  condition 
in  the  form 


y(kj,k,)  c(k,,k,)  cos  k^Rp  cos  k^Rq  sinh  yR  = 0.  (2.  30) 

We  shall  now  shov/  that  ail  such  equations,  provided  p and  q are  not  both  zero,  can 
be  satisfied  by  the  simple  assumption 


UV''-  , 


1 / c;i  nh  y'  R . 


(2.  31) 


where  y is  defined  in  terms  of  kj  and  by  (2.  28). 

To  do  this,  we  note  that  in  case  (2.  31)  lioids,  the  summation  in  (2.  30)  becomes 
^(kj.k^)  cos  kjRp  cos  k.^Rq.  Such  a sum  is  immediately  showm  to  be  zero  on  account 
of  the  familiar  theorems  on  the  orthogonality  of  the  cosine  functions.  We  may  do  it 
dii  eutly  from  the  summation,  or  we  may  rernemher  that  we  are  going  to  pass  to  the 
limit  of  infinite  N,  and  convert  the  sum  into  an  integration.  If  we  introduce  two  con- 
tinuous variables,  Uj  and  q^.  such  that  = k.Rp,  = k^Hp,  t*"en  each  of  the  a's 
is  to  range  from  0 to  -n,  in  extremely  small  steps.  Thus  the  summation  over  k^  le- 
duces  to  bl  times  cos  Uj  daj  = 0,  and  similarly  for  the  summation  over  k^-  We 
thus  see  that  our  eSepression  (2.  29),  in  which  the  c's  are  given  by  (2.  31),  forms  an 
exact  solution  of  our  difference  equations  (2.  27)  everywhere,  except  that  -we  have  not 
yet  considered  the  point  p = q = r = U. 

If  we  substitute  our  solution  in  the  second  equation  of  (2.  27),  we  find 

^(k k2)( l/sinh  yR)  |^^(0)  + V(0)  - E + Z <f(\)  fcos  k^R  + cos  k^R  + exp(- yH)jj.  = 0 

^ (2.  3i 

If  we  substitute  for  E from  (2.  28),  we  have 


X(k,,k2)(l/sinh  yR)[v(C) 


2 i^(  1 ) sinh  yR 


0 


(2.  33) 


We  replace  the  sum.mations  over  kj  and  k^  by  integrations  over  Uj  and  as  before, 
and  write  sinh  yR  in  terms  of  E,  a,,  a,,  using  (.2,  28)  and  the  equation  cosh^x  = 1 + 

I ^ 

sinh  X,  Then  (2.  33)  becomes 
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E - <g-(G) 


cos  (J  ^ 


cos  u 


. / ( 


0. 


(2.  34) 


This  cari  be  put,  in  the*  form 


V(01  1 = 2r>  2?  (1), 


(2.  35) 


where 


I = 


i-./.  ( 


E - ^(0) 

2 ( i ) 


cos  a, 


- ‘1-' 


-02 

^ da.j  da^- 


(2,  .36) 


In  (2.  35)  we  have  an  equation  determining  the  energy  E as  a function  of  the  perturbative 
potential  V(0),  for  the  bound  state  whose  wave  function  is  given  in  (2.  29). 

Tile  integral  (2.  36)  has  not  yet  been  rigorously  evaluated,  and  probably  cannot 

be  carried  through  completely  in  terms  of  well  known  functions.  However,  it  is  not 

hard  lo  give  a sufficiently  accurate  approximation  io  it  for  our  present  purposes.  In 

E - ^(O) 

the  first  place,  we  note  that  unless  — ^ — , ■;  ' is  greater  than  3,  the  integral  will  di- 

2«f(l) 

verge,  and  we  tiave  no  solution  to  our  problem.  The  case  where  it  is  equal  to  3 is  easy 
to  interpret.  This  corresponds  to  the  case  where  E = <^'(0)  + 6 (l),  which  is  the 

value  of  the  energy  at  the  center  of  the  Brillouin  zone,  from  (2.  28).  If  <5’(l)  is  positive, 
thi.s  IS  the  top  of  the  energy  band,  and  in  this  case  we  find  a discrete  state  only  when 
E is  above  the  top  of  the  band;  when  S\i)  is  negative,  it  is  the  bottom  of  the  band,  and 
we  have  a discrete  state  only  below  the  bottom  of  the  band.  This  of  course  is  as  it 
should  be.  Since  the  integral  I by  its  definition  is  positive,  this  means  that  from 
(2.  35)  we  have  a solution  v.'hen  V(0)  has  the  same  sign  as  «5"(l),  just  as  in  the  one- 
dimensional  case.  We  shall  not  lake  up  the  opposite  case,  where  the  disci-ete  level 
appears  at  the  value  of  energy  corresponding  to  the  corner  of  the  Brillouin  zone  rather 
than  the  center,  but  it  can  be  handled  as  easily  as  in  one  dimension,  merely  by  pi  ting 
in  a change  of  sign  on  the  U's  associated  v.nth  adjacent  lattice  points. 

Let  the  value  of  the  integral  I when  E <^(0)  + 6<f(l)  be  called  I^.  The  writer 
has  made  a rough  numerical  integration  for  this  quantity,  and  it  comes  out  so  close  to 
11“^/ 2 that  one  suspects  that  perhaps  this  is  its  exact  value.  (McCrea  and  Whipple,  by 
a different  numerical  method,  find  the  value  5.  02,  as  compared  v”ith  /2  ^ 4.  94.  ) 

For  values  of  |e  - (^(0)J  /2<^’{l)  greater  than.  3,  the  value  leading  to  I^,  it  is  clear 
that  the  denominator  of  (2.  36)  is  increased,  and  I decreased.  Thus  in  such  a case, 
to  satisfy  (2.  35),  V(0)  must  be  increased.  In  other  words,  in  order  to  have  a discrete 
Slate,  V(0)  must  be  at  least  equal  to  2ir  (^{l)/I^,  which  is  approximately  4<^"(l),  using 
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our  approximate  value  of  I^.  That  is,  from  the  difference  equation  as  well  as  from 
the  differential  equation  we  have  shown  that  there  is  no  discrete  state  unless  the  per- 
turbative potential  is  greater  than  a critical  value.  Furthermore,  we  h.ave  been  able 
to  evaluate  this  critical  value,  for  the  simple  cubic  lattice  wiih  nearest -neighbor  in- 
teraction. The  total  width  of  the  energy  band  in  this  case  is  (since  each  of  the 

cosines  in  an  energy  expression  E - «^(0)  + Z S \i)  \coz  k^R  + roe  k^R  + cos 
of  the  foi'm  of  Eq.  (2.4)  can  go  from  - 1 to  1).  In  other  word.s,  the  quantity  V(0)  must 
be  equal  to  at  least  a third  of  the  width  of  the  band,  in  order  that  there  be  a discrete 
state. 

It  is  interest  ng,  though  probably  not  very  significant,  to  compare  the  condition 

for  the  critical  value  of  V(0)  which  we  find  here  with  the  corresponding  value  for  the 

square  potential  well  and  Schi  udinger 's  equation.  We  have  seen  that  in  that  case  we 

have  a discrete  state  provided  V , the  depth  of  the 'potential  well,  is  greater  than 
2 2 ^ 

h /(32mR^  ),  where  is  the  radius  of  the  well.  To  compare  these,  we  must  substi- 
tute for  the  effective  mass  from  Eq.  (?.  11):  if  the  energy  as  a function  of  k is  as 
given  above,  then  (2.  11)  tells  us  that  the  effective  mass  is  - )^*'/2  <^ ( 1 ) R“  (the  negative 
sign  because  it  is  positive  at  the  bottom  of  a band,  which  comes  at  k = 0 if  <^”(l)  is 
negative).  If  we  had  a Schrodinger  equation  with  this  effective  mass,  then,  we  find 
that  the  critical  value  of  is  (Tr^/4)(R^/R^^)  <^(l).  We  may  ask  what  value  R^,  the 
radius  of  the  square  well,  must  be,  in  order  that  this  critical  value  of  should  agree 
with  4(^’(l),  the  critical  value  of  V(0).  Clearly  R^  - (n/4)R,  so  that  R^  i.s  of  atomic 

dimensions,  R being  the  interatomic  distance  in  the  simple  cubic  lattice.  The  volume 

3 '13 

of  the  sphere  within  which  the  potential  is  is  (4-11/ 3)  R^  , which  equals  (r'/43)R  , 

or  approximately  2R  , or  twice  the  volume  per  atom  in  the  simple  cubic  lattice.  It 

seems  likely  that  the  solution  of  Schrodinger 's  equation  for  such  a potential  well  would 

give  a fairly  good  approximation  to  the  solution  of  the  difference  equation. 

Let  us  now  proceed  further  with  our  discussion  of  the  energy  as  a function  of 

discrete 
E - (g‘(0) 

7 jf>  ( \ \ 

CO  \ A / 

= 3 + a,  where  a will  be  a small  quantity.  Now  for  a = 0,  the  integrand  of  (2.  3o;  has 

an  integrable  singularity  at  = 0,  and  this  singularity  is  removed  if  a becomes 

greater  than  zero;  it  is  only  in  this  part  of  the  range  of  integration  that  a small  value 

of  a will  have  a significant  effect  on  the  integral.  In  this  region,  however , we  can 

expand  the  integrand  in  powers  of  the  a's,  and  convert  it  into  a form  which  we  can 

7 

handle  analytically.  If  wp  rpgard  a and  fne  a“'s  as  small  quantities  whose  squares 
and  products  can  be  neglected,  the  integrand  of  (2.  36)  reduces  to  (2a  + 

For  small  a's,  we  can  then  introduce  polar  coordinates  in  an  a,  --a,  space,  the  radius 

2 2 2 1.  ^ 

being  r,  so  that  r = “j  integral  can  then  be  expressed  in  polar  coordi- 

nates, where  we  only  integrate  over  the  first  quadrant;  that  is,  we  replace  da^  da^  by 


V(0).  If 


(0) 


is  only  slightly  greater  than  3,  so  that  the  energy  of  the 


level  is  only  slightly  outside  the  band,  we  can  make  an  expansion  of  I.  Let 
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(tt/2)  r dr,  and  our  integral  becomes  f .g-,  where  we  have  not  written  the 

upper  limit,  since  when  r becomes  large  our  approximation  is  no  longer  valid.  On 
the  other  hand,  when  r l-ecomes  large  the  effect  of  a becomes  negligible,  so  that  we 
need  merely  carry  out  th‘c  integral  .above  to  some  arbitrary  upper  limit,  with  and  with- 
out the  term  in  2a,  to  get  the  diffei’cnce  between  I and  I^.  We  can  carry  out  the 
integration  at  once  by  introducing  x -■=  2a  + r^,  dx  = 2r  dr,  and  find  that 


2a  X"/' 


fUa)’ 


which  holds  for  small  values  of  a.  If  we  substitute  in  (3.  35),  we  can  easily  transform 
to  the  form 


E = ^(0)  + 6^(1)  + f(V(0)  - ^(1)1 

TT'^.^'d)  ■-  \o 

= <f(0)  + 6<f(l)  + [v(0)  - 

16<^(1)  L -I 


z 

if  we  use  the  approximate  value  I — "tt  j 2..  d'Vms:  we  ^ee  that,  just  as  in  the  one- 
dimensional  case,  the  energy  goes  proportionally  to  the  square  of  Ihe  perturbative 
energy  for  small  displacements  of  the  discrete  level  from  the  continuum;  only  here  it 
is  the  increase  of  the  perturbative  energy  over  its  critical  value  which  is  raised  to  the 
second  power,  rather  than  the  perturbative  energy  itself. 

We  are  also  interested  in  the  behavior  for  large  values  of  V(0).  For  sufficiently 

independent  of  Oj  arid'  so  vlitx  t I approache.s  it"  times  this  quantity.  As  a matter 
of  fact,  if  we  expand  the  integrand  of  (2.  36)  in  power  series,  we  find  that  the  term  of 
next  smaller  order  of  magnitude  than  the  leading  one  involves  the  integral  of  cos  Oj  + 
cos  a^,  which  integrates  to  zero,  so  that  this  value  for  I is  quite  good.  Thus  we  find 
that  for  large  perturbations,  E = «^(0)  + V(0),  to  quite  a good  approximation,  show- 
ing the  linear  dependence  of  energy  on  the  perturbation. 

It  is  now  instructive  to  draw  the  energy  as  a function  of  ihe  perturbalion  V(0), 
using  these  two  approximations.  In  Fig.  4,  we  show  in  this  way  E as  a function  of 
V(0).  Clearly  it  is  easy  to  interpolate  between  the  two  limiting  curves,  and  get  a very 
good  estimate  as  to  the  behavior  of  the  actual  energy  level.  We  have  drawn  the  curve 
for  a positive  value  of  tf(l),  so  as  to  indicate  a discrete  level  above  the  bottom  of  a 
band,  produced  by  a repulsive  perturbation,  if  V(0)  is  positive.  Thus  this  agrees  with 
the  case  shown  in  Fig.  2 for  the  one -dimensional  case.  The  difference  between  the 


large  values  of 


- (g’(O) 


it  is  clear  that  the  integrand  of  {2.  36)  approaches 
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two  cases  is  clear:  the  discrete  state  in  F'ig,  4 does  not  begin  to  emerge  from  the 
continuum  until  V(0)  reaches  its  critical  value,  whereas  in  Fig.  2 it  emeiges  for  in- 
finitesimal v(o). 

Another  interesting  way  to  show  the  results  is  given  in  Fig.  5.  Here  we  show 
energy  as  a function  of  the  band  width  <^(l),  keeping  the  perturbative  potential  V(0) 
constant.  This  shows  what  would  happen  if  we  started  with  a lattice  of  separated 
aimns,  so  widely  separated  that  t.he  energy  levels  were  nox  broadened  into  a band.  At 
zero  band  width,  the  one  level  of  the  impurity  atom  would  lie  above  that  of  the  rest  of 
the  atoms  (if  V(0)  is  positive).  As  the  atoms  are  pushed  closer  together,  and  the  band 
broadens,  we  see  that  it  dues  not  begin  to  perturb  the  discrete  level  very  much  until 
the  band  is  so  broad  that  it  almost  touches  t»:e  discrete  level.  Then  it  repels  the  dis- 
crete level  upward,  so  that  the  discrete  level  remains  above  the  band  but  comes  closer 
and  closer  to  it,  until  finally  the  discrete  level  becomes  incorporated  into  the  band, 
at  the  point  where  the  total  band  width  is  about  three  times  V(0). 

With  this  discussion,  we  have  a fairly  complete  understanding  of  the  problem 
of  a single  perturbing  atom  in  a simple  cubic  lattice;  since  the  results  here  are  so 
close  to  those  for  the  continuum  theory,  we  may  assume  that  the  same  thing  will  hold 
without  qualitative  change  in  other  types  of  lattices.  Now  let  us  consider  the  change 
in  the  situation  if  the  perturbative  energy  is  felt  over  a number  of  atom.s,  instead  of 
just  on  one  atom.  This  problem  is  too  hard  to  attenipt  to  solve  analytically,  but  it  is 
not  hard  to  make  a good  guess  as  to  what  will  happen.  We  may  expect,  as  in  one  di- 
mension, that  there  will  be  a number  of  discrete  levels,  for  sufficiently  great  per- 
turbative potentials.  As  a general  rule,  the  discussion  which  we  have  just  given  makes 
it  fairly  plain  that  as  the  perturbative  potential  increases,  these  discrete  levels  may 
split  off  from  the  continuum  successively,  rather  than  all  at  once.  The  various  wave 
functions  can  have  different  types  of  symmetry.  1 hese  functions,  as  we  have  men- 
tioned earlier,  must  belong  to  the  various  irreducible  representations  of  the  point 
symmet’'y  group  of  the  cxyslal  with  respect  to  the  center  of  the  perturbation.  Thus, 
if  the  perturbative  potential  and  the  crystal  both  have  cubic  symmetry,  we  shall  have 
the  various  possible  symmetry  types  of  the  cubic  group,  one  being  s-like,  a three- 
fold degenerate  p-like  type,  and  so  on. 

The  particularly  interesting  problem  in  three  dimensions  is  the  Coulomb  poten- 
tial. The  reason  for  this  comes  from  its  occurrence  in  semiconductors.  We  are  not 
going  at  the  moment  to  examine  the  nature  of  the  perturbing  field  to  be  expected  in 
various  types  of  problems;  this  is  a problem  in  self-consistent  fields,  raibcn  than  of 
solving  a periodic  potential  problem,  and  we  wish  to  separate  these  two  parts  of  the 
question,  first  understanding  the  nature  of  the  solutions  of  the  one -electron  Schrodinger 
equation  before  we  build  up  determinantal  many -electron  wave  functions  and  apply  a 
condition  of  self-consistency.  However,  we  may  anticipate  enough  to  state  the  general 
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situation  encountered  when  an  impurity  atom  is  introduced  substitutionally  into  a 
crystal.  The  result  is  quite  different,  depending  on  whether  we  are  dealing  with  a 
good  conductor  like  a metal,  or  a poor  conductor  like  a semiconductor.  Let  us  as- 
sume that  the  impurity  atom  has  a nuclear  charge  greater  or  less  than  that  of  the 
crystal  by  a few  units.  If  there  were  no  shielding,  this  would  introduce  a perturba- 
tion of  the  Coulomb  type,  which  outside  the  impurity  atom  itself  would  be  proportional 
to  the  difference  betv/een  the  nuclear  charges  of  the  impurity  atom  and  the  atoms  of 
the  crystal.  In  an  actual  case,  most  of  this  field  will  be  shielded  by  a rearrangement 
of  electronic  charge.  There  will  usually  be  enough  discrete  levels  with  wave  func- 
tions concentrated  close  to  the  impurity  atom  so  that  these  levels  can  be  occupied  by 
electrons,  or  emptied,  as  the  case  may  be,  to  make  the  impurity  atom  approximately 
uncharged.  In  a good  conductor,  the  conduction  electrons  will  rearrange  their  charge 
distributions  so  as  almost  completely  to  shield  this  Coulomb  potential,  just  as  any 
electric  field  in  a metal  is  almost  completely  neutralized  by  the  conduction  electrons. 
Ordinarily  in  a metal,  then,  the  perturbative  potential  arising  from  the  self-consistent 
problem  is  confined  almost  entirely  to  the  perturbing  atom,  in  other  words,  we  have 
almost  precisely  the  case  which  we  have  been  taking  up  so  far  in  this  section,  leading 
to  discrete  levels  if  the  perturbative  term  V(0)  is  great  enough,  but  to  no  new  energy 
levels  if  it  is  below'  a critical  value.  We  shall  have  to  come  back  to  this  case,  then, 
and  its  implications  for  the  determination  of  the  self-consistent  field,  when  we  dis- 
cuss alloys,  the  effect  of  an  atom  of  one  type  substituted  in  a crystal  of  another  type. 

An  impurity  atoi  in  a metal,  as  we  have  just  stated,  has  its  field  almost  com- 
pletely neutralized  even  at  the  position  of  its  nearest  neighbors  by  the  conductivity, 
just  as  we  should  conclude  from  looking  at  the  electrical  properties  of  the  metal  from 
a macroscopic  point  of  view.  In  a semiconductor,  on  the  contrary,  the  conductivity 
is  not  great  enough  for  this  to  happen,  and  the  problem  is  much  more  like  the  electro- 
static one  of  a charge  in  a dielectric.  The  dielectric  constants  of  some  of  the  im- 
portant semiconductors  are  very  high,  that  of  germanium  being  of  the  order  of  mag- 
luiuue  of  16.  The  electric  field  of  a point  charge  in  such  a dielectric  will  be  less  than 
In  empty  space,  being  inversely  proportional  to  the  dielectric  constarit.  This  dielec- 
tric effect  of  course  is  itself  a manifestation  of  the  self-consistent  field.  If  we  look 
at  it  microscopically,  we  see  that  the  charge  polarizes  all  the  surrounding  atoms, 
displacing  their  electrons  with  respect  to  their  nuclei  so  as  to  produce  dipoles  on 
them,  and  it  is  well  known  tiiat  it  is  the  superposition  of  the  fields  of  these  dipoles 
which  cancels  most  of  the  field  of  the  polarizing  charge.  We  should  get  at  this  effect, 
in  a completely  logical  treatment  of  our  problem,  by  solving  for  the  wave  functions 
of  the  polarized  atoms  in  the  external  polarizing  field,  which  would  be  part  of  the 
self-consistent  field,  and  making  the  problem  self-consisteni,  which  would  im.ply  that 
each  atom  had  just  such  a dipole  moment  that  the  sums  of  ihese  dipole  fields  produced 
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the  dielectric  effect  by  simple  electrostatics.  Ordinarily,  however,  we  are  wiliing 
not  to  look  into  the  problem  in  such  a fundamental  way,  but  to  accept  the  value  of  the 
macroscopic  dielectric  constant  as  being  given,  and  to  assume  that  it  correctly  de- 
scribes the  self-coasistem  effect  of  the  polarizing  charge  in  the  dielectric. 

We  conclude,  then,  that  if  a dielectric  contains  an  impurity  atom  which  nor- 
mally has  an  excess  or  deficiency  of  charge,  the  potential  produced  by  this  atom  at 
a distance  will  be  that  produced  by  its  net  charge,  computed  for  the  actual  ditleciric 
constant.  That  is,  we  shall  have  a Coulomb  potential,  but  ordinarily  a good  deal  less 
in  magnitude  than  in  free  space,  on  account  of  the  large  dielectric  constant.  We  thus 
assume  that  if  it  is  legitimate  to  replace  the  difference  equation  by  the  differential 
equation,  we  shall  have  Schrodinger ‘s  equation  for  a hydrogen -like  problem,  but  with 
a potential  decreased  in  proportion  to  the  dielectric  constant,  and  an  effective  mass 
instead  of  the  real  mass.  The  solutions  will  then  be  the  ordinary  hydrogenic  solutions, 
but  as  we  have  stated  earlier,  the  wave  functions  will  be  much  more  spread  out,  and 
the  energies  much  less.  In  this  case  we  have  an  infinite  number  of  stationary  states, 
instead  of  a finite  number  as  with  the  perturbation  located  only  on  a iin^te  number  of 
atoms.  Furthermore,  we  do  not  have  any  critical  lo  ''“r  limit  for  the  magnitude  of  the 
perturbative  potentia.,  below  which  we  do  not  find  discrete  states;  that  is  a character- 
istic of  the  problem  where  the  perturbations  are  confined  to  a finite  number  of  atoms. 

We  also  find  something  in  the  Coulomb  case  which  we  do  not  in  the  case  of  a 

single  perturbing  atom:  the  differential  equation  forms  a good  approximation  to  the 

solution  of  the  difference  equation.  The  i eason  is  that  in  this  case  the  solution  of  the 

difference  equation  really  varies  slowdy  from  atom  to  atom,  and  we  have  seen  that  this 

is  the  condition  for  the  applicability  of  the  differential  equation  method.  Thus,  for  the 

ground  state  of  hydrogen,  the  wave  function  is  exp(-  yr),  which  remains  finite  at  r ~ 0, 

and  varies  slowly  from  atom  to  atom,  if  -yR  is  small  compared  to  unity,  where  R is 

the  interatomic  distance.  We  have  this  situation  in  the  cases  we  are  interested  in. 

This  is  in  striking  contrast  to  the  case  of  a single  perturbing  atom,  where  we  have 

seen  that  the  w'ave  function  is  of  the  form  thus  feel  that  the  conven- 

r 

tional  derivation  of  the  Impurity  levels  in  a semiconductor  on  the  basis  of  hydrogenic 
wave  functions  is  generally  legitim.ate. 

There  is,  however,  a feature  of  the  situation  which  has  generally  been  over- 
looked, and  which  can  be  very  important.  The  dielectric  effect  shields  the  iieid  of  the 
perturbing  atom  at  the  positions  of  all  neighboring  atoms,  but  it  does  not  affect  tiie 
perturbative  potential  on  the  perturbing  atom  itself.  In  other  words,  the  perturbation 
V(0),  the  average  of  the  perturbatvve  potential  over  the  Wannier  function  located  on  the 
impurity  atom,  can  be  just  as  great  in  a semiconductor  as  in  a metal.  This  is  larger 
than  the  value  which  would  be  consistent  with  the  hydrogenic  problem,  roughly  in  the 
ratio  of  the  dielectric  constant.  In  other  words,  the  Schrodinger  problem  which  is 
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really  appropriate  in  this  case  is  one  with  a Coulomb  potential  at  all  neighboring  atoms, 
with  a dielectric  tei-m  to  make  it  small,  but  in  aclduLoii  a potential  well  at  the  central 
atom.  We  may  now  legitimately  combine  the  t;>pes  of  arguments  which  we  have  used 
in  discussing  the  potential  well  problem,  and  the  Coulomb  problem.  We  may  assume 
that  unless  the  potential  well  becomes  too  deep,  exceeding  a critical  value,  the  wave 
function  and  energy  level  will  not  be  appreciably  affected  by  it.  In  other  words,  it  is 
likely  that  the  hydrogenic  wave  function  and  energy  level  corresponding  to  the  Coulomb 
potential  can,  so  to  speak,  resist  the  effect  of  a perturbation  on  the  central  atom,, 
provided  this  perturbation  is  not  too  large.  The  wave  function  will  be  modified  in  the 
immediate  neighborhood  of  the  central  atom,  but  it  extends  over  so  many  atoms  tha* 
this  m.odification  is  not  very  important,  and  will  not  change  the  energy  very  much.  If 
the  pert\irbation  of  the  cenU-al  atom  becomes  too  great,  however,  the  problem  will 
change  completely.  The  energy  level  will  fall  far  below  the  hydrogenic  value  (we  are 
assuming  that  the  perturbing  potential  energy  is  negative,  as  it  would  be  if  the  per- 
turbing ion  were  positively  charged,  or  as  in  the  case  of  a donor  atom  below  the  bot- 
tom of  a conduction  band).  At  the  same  time,  the  wave  function  will  become  much 
more  concentrated  around  the  perturbing  atom,  as  in  the  discrete  levels  in  the  prob- 
lem of  a single  perturbing  atom. 

This  situation  can  very  likely  occur  in  practice.  We  understand,  in  the  lii  st 
place,  on  the  basis  of  this  argument,  why  in  so  many  cases  we  seem  to  get  good  agree- 
ment, quantitative  as  well  as  qualitative,  between  the  simple  hydrogenic  theory  and 
observed  impurity  levels  in  semiconductors.  These  are  the  cases  in  which  the  per- 
turbative energy  on  the  perturbing  atom  is  le.ss  than  the  critical  value.  Different  per- 
turbative atoms,  under  these  circumstances,  would  have  alntost,  though  not  quite,  the 
same  wave  futictions  and  energy  levels,  the  discrepancies  coming  only  in  the  immedi- 
ate neighborhood  of  the  perturbing  atom.  On  the  other  hana,  we  should  also  expect 
cases  of  greater  perturbations,  in  which  the  whole  character  of  the  wave  functions 
and  energy  level  changes,  and  the  energy  level  is  much  further  below  the  conduction 
band  (or  much  further  above  the  valence  band,  if  we  are  dealing  with  acceptor  impuri- 
ties). This  may  well  be  the  explanation  of  the  deep  traps,  which  seem  to  occur  par- 
ticularly in  silicon.  Such  trapping  levels  seem  to  explain  the  long  time  constants  ob- 
served in  some  experiments  with  silicon,  and  similar  levels  have  been  postulated 

( 2) 

in  silicon  after  neutron  bombardment;  ' ' these,  of  course,  ,may  well  come  from  inter- 
stitial rather  than  substutional  impurity  atoms,  and  this  is  a problem  slightly  different 
from  those  we  have  discussed  so  far,  but  the  principle  is  not  different. 


^J.  R.  Haynes  and  J.  A.  Hornbeck,  Phys.  Rev.  90,  152  (L),  (l953). 
^W.  E.  Johnson  and  K.  Lark-Horovitz,  Phys.  Rev.  ?6,  4-iZ  (1949). 
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Finally,  we  should  mention  that  if  the  dielectric  constant  of  a semiconductor 
or  insulator  is  not  very  great,  the  Coulomb  type  of  problem  will  lead  to  a wave  function 
whi  'h  is  no  longer  extended  over  many  atoms,  but  is  more  concentrated.  In  such  n 
case  it  will  no  longer  be  justified  to  replace  the  difference  equation  by  the  differential 
equation,  and  wc  must  antitipate  quantitative  errors  in  the  use  of  the  hydrogcnic  solu- 
tion. We  hardly  expect,  however,  any  very  striking  qualitative  change  in  the  situa- 
tion. 

4.  Wannier  Functi ons  for  Overlapping  Bands 

In  our  discussion  of  the  preceding  section,  we  have  assumed  that  we  were 
dealing  v/ith  a non-degenerate  band  of  s-like  character,  and  that  the  effective  masses 
in  the  X,  y,  and  z direciion.s  were  identical.  In  many  actual  cases,  however,  notably 
in  germanium  and  silicon,  these  assumptions  are  not  correct.  We  have  degenerate 
or  overlapping  bands,  with  their  various  Wannier  functions.  In  our  discussions  of 
Chapter  1,  we  did  not  take  up  the  nature  of  the  W'annier  functions  in  such  a case, 
since  we  were  not  going  to  encounter  these  problems  in  that  chapter.  Now,  'nowever, 
these  questions  are  essential.  We  meet  a problem  having  two  aspects.  First,  what 
are  the  Wanniex-  functions  like  in  cases  of  degenerate  bands?  Secondly,  it  seems  al- 
most obvious  in  such  a case  that  we  must  use  the  Wannier  functions  of  the  various 
overlapping  bands  in  order  to  get  a proper  representation  of  the  impurity  wave  func- 
tion. We  must  then  look  into  the  question,  which  we  have  avoided  so  far,  of  building 
up  a solution  from  the  Wannier  functions  of  several  bands,  rather  than  of  one  band 
only.  The  way  in  which  this  will  enter  our  problem,  of  course,  will  be  that  the  per- 
turbative potential  will  have  non-diagonal  matrix  components  between  Wannier  func- 
tions of  different  bands.  We  shall  have  to  look  into  the  whole  question  of  the  non- 
diagonal  m.atrix  components  of  the  perturbative  energy,  a question  with  far-reaching 
consequences. 

Before  going  into  the  question  of  the  Wannier  functions  for  overlapping  bands, 
we  must  remind  ourselves  of  the  nature  of  overlapping  bands  themselves,  which  we 
have  discussed  in  Chaprer  1.  The  method  which  we  found  most  informing  in  a quali- 
tative way  for  discussing  this  pi'oblem  was  the  simplified  tight -binding  approximation, 
which  wc  took  up  in  Sections  9 - 12  of  Chapter  1.  Let  us  use  this  method  in  a first 
survey  of  the  situation.  We  remember  that  the  general  method  starts  by  setting  up 
Bloch  sums  of  a number  of  atomic  orbitals,  which  can  be  of  s,  p,  d,  etc.  , type.  V^e 
then  calculate  the  matrix  components  of  the  one-electron  Hamiltonian  between  these 
Bloch  sums,  making  many  simplifications  such  as  disregarding  overlap  effects,  three - 
center  integrals,  and  l elated  quantities.  We  solve  the  secular  equation  between  tlie 
various  unperturbed  wave  functions,  to  get  the  energy  levels,  and  the  appropriate 
combinations  of  unperturbed  wave  functions  giving  the  best  approximations  to  solutions 
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of  Schrodinger's  equation. 

Let  Ua  assume  that  our  original  atomic  orbitals  are  so  that  an  atomic 

orbital  located  on  an  in  the  cell  at  displacement  R.  from  the  origin  is  4>^(r  - R.). 

Let  as  assume  that  we  have  solved  the  secular  equation  last  described,  so  that  the 
wave  function  corresponding  to  the  propagation  vector  k can  be  written 


m,  1 


K.; 


f (k ) exp(ik 
mn  ’ ^ 


4)  (r  - R ). 

1' 


(2.  39) 


This,  as  we  see,  is  a linear  combination  of  Bloch  sums  formed  from  the  various 
aiv'jinic  orbitals  <t>^;  the  coefficients,  i^^{k  ),  depend  on  the  propagation  veiTor  k,  as 
we  indicate  explicitly.  We  have  spoken  as  if  this  determination  of  the  wave  functions 
depended  on  our  simplified  tight-binding  approximation,  but  we  note  that  tne  exact 
solution  of  a tight -binding  method,  taking  account  of  all  refinements,  would  still  nave 
the  form  (2.  39),  thc'Ugh  the  coefficients  f^^_^(k)  would  have  somewhat  different  numeri- 
cal values, 

A question  can  well  arise  concerning  the  assignment  of  indices  m to  the  vari- 
ous energy  bands.  If  we  have  a certain  number  of  atomic  orbitals,  then  the  secular 
equation  will  yield  an  equal  number  of  functions  j^r  ).  In  general,  however,  there 
will  be  no  one-to-one  correspondence  between  one  of  the  final  energy  bands,  and  one 
type  of  atomic  orbital.  The  orJy  thing  which  we  can  do  to  name  the  bards,  or  to  give 
them  the  indices  m appearing  in  tne  notation  j^(r  ) for  the  wave  functions,  is  to 
number  them  up  from  the  lowest  energy  band.  We  cannot  even  identify  a single  band 
as  an  s,  or  a p,  or  a d band:  in  general,  we  shall  have  mixtures  of  all  types  of  atomic 
orbitals,  in  each  band,  though  in  some  cases,  as  in  the  tightly  bound  states,  we  shall 
have  a band  whose  wave  functions  are  composed  almost  entirely  of  one  type.  The 
Wannier  functions  associated  with  one  band,  then,  will  be  combinations  of  all  the  dif- 
ferent types  of  atomic  orbitals.  Sometimes  we  have  a single  band,  widely  separated 
from  others,  so  iliat  the  wave  functions  of  that  band  are  composed  almost  entirely  of 
one  type  of  atomic  orbitail.  Then  the  Wannier  fu*iction  will  also  be  composed  almost 
emirely  of  that  type  of  orbital;  but  with  overlapping  bands,  like  the  p and  d bands, 
this  will  no  longer  be  the  case,  and  in  a p band  each  Wannier  function  will  be  made 

up  in  a complicated  way  from  the  p , p , p atomic  orbitals. 

X y 

There  is  one  very  important  feature  of  the  bands,  and  their  rt;lalion  to  the 
Wannier  functions:  two  bands,  as  we  have  defined  them,  ca.n  never  be  degenerate 
with  each  other.  This  is  a loose  way  of  describing  the  following  situation.  Except 
at  certain  symmetry  points  in  the  Brillouin  zone,  all  the  energy  levels  of  the  systen. 
will  be  separated  from  each  other;  there  will  be  no  oegenerale  states  for  an  arbitrary 
k value.  If  then  we  associate  all  the  lowest  energy  levels,  for  all  k values,  to  form 
the  lowest  energy  band,  ail  the  next  lowest  le'.’cls  to  form  the  second  band,  and  so  on. 
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ther.P  bands  will  all  be  distinct  from  each  other.  The  average  energy  of  eac  n v<ill  lie 

higher  tnan  the  average  energy  of  the  next  band  below  it.  It  is  in  this  sense  that  there 

is  no  degeneracy.  Out  there  is  an  immediate  consequence  of  this;  there  cannot  be  any 

degeneracy  in  the  Wannier  functions,  in  the  sense  that  the  Wannier  function  oi  one 

band  cannot  transform  into  the  Wannier  function  of  another  band  by  one  of  the  symmetry 

operations  of  the  crystal.  We  cannot  have  Wannier  functions,  for  instance,  which 

have  the  symmetries  of  p , p , and  p aiomic  functions,  for  in  such  a case  one  of  them 

X y z 

would  transform  into  another  by  one  of  the  symme'ry  operations  of  the  crystal.  Rather, 
in  the  p case,  we  have  three  quite  distinct  Wannier  functions,  one  corresponding  to 
the  lowest  p band,  the  second  to  the  next  lowest,  the  third  to  the  highest  The  diagonal 
matrix  component.s  of  energy  of  these  three  Wannier  functions  (tlie  quantities  which  we 
have  been  denoting  as  r?(0))  will  thus  be  different  for  the  three,  and  so  will  the  other 
matrix  components.  And  each  of  these  three  Wannier  functions  must  be  built  up  in 

some  sort  of  symmetrical  way  out  of  p , p , and  p atomic  orbitals. 

X y z 

Crystal  symmetiy  gives  us  quite  definite  inforrciation  about  the  Wannier  func- 
tions, even  though  it  does  not  lead  to  degeneracy.  It  is  of  course  clear  that  a Wannier 
function  must  be  transformed  into  other  Wannier  functions  by  any  operation  of  the 
group  of  crystal  symmetry  operations;  but  we  have  jusl  seevi  tiiat  there  can  be  no  de- 
generacy, or  that  really  a Wannier  function  cannot  be  transformed  into  a distinct  Wan- 
nier function  by  one  of  these  operations.  The  only  alternative  is  that  it  should  be 
transformed  into  a multiple  of  itself.  Thus  we  arrive  at  the  important  fact  that  a Wan- 
nier function  must  belong  to  a one-dimensional  irreducible  representation  of  the  sym- 
metry' group  of  the  crystal.  For  instance,  in  the  cubic  group,  we  remember  that  there 
are  four  one -dimensional  representations,  which  we  cam  describe  easily  ti.3  lOxiOWS! 
the  first  transforms  into  itself  under  all  operations  of  the  group;  the  second  changes 
sign  on  reflection  in  the  planes  x = 0,  y - 0,  and  z ~ 0,  but  not  or:  reflection  in  the 
planes  x = ty,  y = tz,  z=tx;  the  third  changes  sign  on  reflection  in  the  planes 
X = t y,  etc.  , but  not  in  x = 0,  etc;  and  the  fourth  changes  sign  on  reflection  in  each 
set  of  planes.  There  are  correspondingly  four  one-dim.ensional  representations  in 
the  group  of  the  square.  We  now  see  that  each  Wannier  function,  no  matter  whether 
made  up  of  s,  p,  d,  or  what  type  of  atomic  orbitals,  rniust  transform  according  to  one 
of  these  very  limited  types  of  symmetry. 

It  is  somewhat  puzzling  at  first  sight  to  see  how  the  atomic  orbitals  are  com- 
bined to  give  Wannier  functions  cf  quite  different  sy'mmetry  type.  We  can  understand 
it  clearly,  however,  if  we  examine  the  detailed  way  in  which  the  Wannier  functions 
are  set  up.  In  Eq.  (l.  25),  Chapter  1,  we  find  that  the  Wannier  function  - R^)  can 

be  written  in  terms  of  the  energy  band  wave  function,  which  we  are  her»>  writing 
"m  equation 
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i-f . » 


1/2 


Z(^)  .--(O 


exp(-  ik 


(2.  40) 


Hen;  the  siirnmation  over  k is  to  be  carried  over  the  M c.istnict  k values  in  the  unit 

cell  in  k space.  We  can  then  substitute  for  B r-(r  } from  (2.  39).  In  doiny  this,  it  is 

rn , K 

well  to  take  advantage  of  the  fa<'d  that  the  coefficient  ) must  be  a periodic  function 

m k space;  we  Knov/  this,  since  both  the  exponential  functions  exp(k  • R.)  and  the 
Bloch  functions  ^ pei  iodic  functions.  Tiius  we  can  expand  the  f's  in  ..he 

form 


f (iT ) = T{R.)  F (R.)  exo(ik  ■ R,). 
■nn'  ' ^ r mn'  r • ' r 


(2.  41) 


If  we  now  substitute  (2.  39)  and  (2.  41)  in  (2.  lO),  we  have 


a (r 
m 


R ) = N"  y(k,  n,  R.,  R,)  F (K.)  exp(ik 
j » * I*  I'  xnn  r 


(R.  - R,  + R^  - 


RD.  U.4^) 


We  may  now  consider  separately  the  sum  over  k in  (2.  42).  We  remember  that  sucli 


a sum  is  zero,  unless  the  quantity  R.  - R.  + R = 0. 


hat  case,  since  we  are  sum- 


i j -•£ 

ming  over  N lattice  points,  it  is  N.  Thus  we  ran  reduce  the  expression  (2.42)  to 


1 / ->  _ 


a (r  - R.)  = N*'*'  ) (n,  RJ  F (R.)  4.  (r  - R.  + R,). 
m j'  ’ t'  mn'  r j r 


(2.43) 


Eq.  (2.  43)  is  a very  important  result  sliowing  how  the  Wannier  funcUon  a (r  - R 
th  -►  J 

for  the  m band  and  the  unit  cell  at  R , is  written  in  terms  of  atom.ir  orbitals  <()  (r  - 

-►-►  th  -► 

Rj  + R|).  of  tile  n type,  located  in  cells  displaced  by  R^  from  the  cell  where  we  are 
finding  the  Wannier  function.  It  shows  us  at  once  that  the  way  to  build  up  the  VVannier 
function  of  a band  arising  from  degenerate  atomic  orbitals  is  to  superpose  atomic  or- 
bitals, not  only  on  the  atom  where  we  are  finding  the  Wannier  function,  hut  on  neighbor- 
ing atoms  as  well;  and  it  shows  us  that  the  way  to  find  the  coefficients  is  to  get  the 

Fourier  coefficients  F (RJ  of  the  transformation  coefficients  f (k  ) transforming 

mn  I mn  ^ 

the  original  atomic  orbitals  to  the  final  Bloch  functions. 

We  can  now  e.asily  work  out  special  cases,  to  see  the  sort  of  superposition  of 

atomic  orbitals  actually  arising  in  cases  met  in  practice.  Thus  the  two-dimensional 

p degeneracy  in  the  square  lattice  has  been  discussed  in  Chapter  1,  pp.  46-47.  We 

can  divide  the  square  Brillnuin  zone  into  four  regions,  by  the  diagonal  lines  k = t k . 

y X 

in  one  of  these  regions,  is  positive  and  greater  numerically  than  k^;  in  the  second, 

k^  is  positive  and  greater  nu.merically  than  k^;  in  the  third,  k^  is  negative  and  greater 

numerically  k ; and  in  the  fourth  k is  negative  and  greater  numerically  than  k . In 

y y X 

the  first  and  third  of  these  reeions  the  lower  band  corresponds  to  a linear  combination 


\ 
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Fig.  6 

Wannier  function  foi'  lower  (a)  anu  upper  (b)  bands  arising  from  degeneracy 
of  two  p states  in  two-dimensional  square  lattice. 


which  is  composed  almost  entirely  of  p^,  hardly  at  all  of  p.  , while  in  the  sec(md  and 

X T 

fourtii  the  lower  band  is  almost  entirely  p hardly  at  all  p . 1 hus  the  function  f for 

the  coefficient  of  the  p^  function  in  the  expansion  of  the  lower  energy  band  is  almost 
unity  (or  a constant,  cui  l ect  to  produce  normalization!  in  the  first  and  third  regions, 
almost  zero  in  the  second  and  fourth.  Examination  of  the  symmetry  of  the  Wannier 
function  shows  that  to  obtain  a function  whose  symmetry  belongs  to  one  of  the  irre- 
ducible representations  of  the  square  group,  the  function  f must  as  a matter  of  fact 
have  opposite  signs  in  the  first  and  third  regions  mentioned  above,  and  to  get  a real 
Wannier  function  it  must  be  pure  imaginary.  We  can  then  approximate  the  pro’oieiu 
setting  up  this  f equal  to  i in  region  i,  - i in  region  3,  and  zero  in  regions  2 and  4. 
Similar  situations  are  encountered  for  the  otner  Fs. 

The  writer^  has  shown  how  to  expand  these  approximate  f's  in  Fourier  series 
of  the  form  (2.41),  and  the  corresponding  expansions  of  the  Wannier  functions,  of  the 
form  (2.  43),  were  arrived  at.  The  results,  for  the  lower  and  upper  p bands,  were 
presented  graphically,  in  the  form  shown  in  Fig.  6.  Here  the  circles  represent  atoms; 
the  Wannier  function  is  being  computed  for  the  central  atom  in  each  figur.i.  They  con- 
sist of  p-like  unperturbed  atomic  functions  on  adjacent  atoms,  pointing  in  the  direction 
of  the  arrows,  and  with  amplitudes  indicated  schematically  by  the  lengths  of  the  ar- 
rows. There  is  no  contribution  in  either  case  on  the  central  atom  itself.  We  can  now 


I _ , ^ 

■J.  C.  blater,  Fh^'s.  Rev.  87,  807  (1952);  see  particularly  pp.  628-833. 
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sec  clearly  from  Fig.  6 how  it  can  be  that  the  p-iike  atomic  ui  bitals  can  r;ombine  to 
give  a Wannier  function  whose  symmetry  properties  are  quite  different  from,  those  of 
the  individual  orbitals.  la  the  rase  (a),  it  is  clear  that  the  Wannier  function  belongs 
to  the  representation  of  the  square  group  which  transforms  into  itself  under  each  op- 
eration of  the  group.  In  case  (b),  on  the  contrary,  the  Wannier  function  belongs  tc 
the  representation  which  changes  sign  on  reflection  in  the  lines  x ' 0 and  y - 0,  and 
alee  on  reflection  in  the  lines  x = t.  y.  These,  we  remember,  are  two  out  of  the  four 
possible  symmetry  types  for  Wannier'  functions  in  the  square  lattice. 

One  inLer'e.stli'ip'  feature  of  the  W'annier  functions  i.s  observed  in  Fig.  6;  tliere 
is  no  contribution  'tn  the  Warmiei'  function  from  atomic  orbitals  on  the  central  atom, 
where  the  Wannier  function  is  centered.  This  must  clearly  always  be  tne  case  if  xhe 
symmetry  of  the  Wannier  function  is  different  from  the  symmetry  type  of  the  atomic 
orbitals  of  which  it  is  composed.  The  case  of  an  s-like  atom’c  orbital  is  one  of  the 
few  cases  in  which  an  atomic  orbital  possesses  the  same  symmeU-y  type  as  the  Wan- 
nier  function;  in  p and  d atomic  orbitals  each  of  the  orbitals  belongs  to  a two-  or 
three-dimensional  representation  of  the  cubic  point  group,  and  since  the  Wannier 
functions  must  belong  to  one  dimensional  representations,  we  cannot  have  any  con- 
irib'uiions  on  the  central  atom  when  we  are  dealing  with  Wannier  functions  derived 
from  p or  u orbitals.  Thi-s  has  the  result,  of  course,  that  the  V/annier  functions 
are  concentrated  on  .atoms  distant  from  the  central  one  on  which  they  are  formally 
located,  so  that  they  really  are  much  more  extended  in  space  than  has  been  generally 
assumed.  The  reason  for  this  general  misconception  is  that  almost  all  the  oiscussions 
of  Wannier  functions  have  disregarded  orbital  degeneracy,  and  ha-.e  been  carried  cut 
as  if  we  were  dealing  with  s orbitals. 

The  extended  nature  of  Uie  Wannier  functions  means  that  there  are  rnairix 
components  of  energy  between  much  more  distant  neighbors  than  we  should  have  if 
they  were  more  like  single  atomic  orbitals.  We  can  see  this  also  from  Eqs.  (2.  2) 
and  (?. . 4).  These  show  us  that  the  matrix  component  of  energy  between  two  Wannier 
functions  displaced  by  a vector  R.  with  respect  to  each  other  equals  the  corresponding 
Fourier  component  in  the  Fourier  expansion  of  the  energy  E(k ) of  the  energy  band,  as 
a function  of  k.  Now  this  energy  E(k  ) is  found  as  the  eigenval'ue  of  the  secular  prob- 
lem of  making  linear  combinations  of  atomic  orbitals  to  set  up  the  Bloch  functions. 

In  such  a case  as  that  of  th  two-dimensional  square  problem  of  p degeneracy,  which 
we  have  mentioned  earlier,  the  energy  as  a function  of  k can  be  quite  complicated, 
consisting  approximately  of  the  energy  of  the  Bloch  function  formed  from  p^  orbitais 
over  part  of  the  Brillouin  zone,  approxim.ately  the  energy  of  ihe  function  formed  from 
p over  the  r est.  At  the  center  of  the  Brillouin  zone,  as  we  saw  in  Section  7,  Chapter 
1,  the  energy  has  a singular  point,  and  cannot  be  expanded  in  power  series;  its  second 
derivative  is  discontinuous  at  that  point.  We  must  expect,  then,  that  the  Fourier 
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expansion  of  the  energy  will  converge  very  slowly.  Tins  means  that  there  will  be 
components  (R^)  connected  with  quite  large  values  of  R^;  in  other  words,  the  inter- 
actions with  quite  distant  neighbors  are  important.  This  situation,  then,  is  quite  dif- 
ferent from  that  with  s-like  Wannier  functions,  where  it  can  be  quite  a good  approxi- 
mation to  considei-  only  interactions  with  perhaps  nearest-  and  second-nearest  neigh- 
bors. "^I  his  can  lead  to  considerable  increase  in  the  difficulty  of  handling  the  differ- 
ence equation  solution  lor  impurity  levels;  it  can  also  make  it  much  less  legitimate 
than  in  tiie  simpler  cases  to  replace  the  difference  equation  by  a differential  equation. 

We  miay  well  be  disconcerted  to  find  that  we  do  not  have  Wannier  func’  jons 
transforming  accoiding  to  the  same  symmetry  behavior  as  the  atomic  orbitals.  It  is, 
of  course,  perfectly  possible  to  have  impurity  wave  functions  transforming  with  the 
same  symmetry  properties  as  atomic  orbitals,  and  we  might  well  wish  to  have  Wan- 
nier  functions  transforming  in  the  same  wav.  I he  reason  why  we  do  not  have  ti'is  is 
the  convention  which  we  have  made  regarding  the  definition  of  the  bands.  Foster''^ 
has  shown  that  it  is  possible  to  define  the  hands  differently,  in  such  a w.ay  as  to  ob- 
tain Wannier  functions  having  the  same  transformation  properties  as  the  atomic  or- 
bitals. Tnus,  for  instance,  in  the  tv/o-dimensional  so,uare  lattice  case  of  the  p bands, 
which  we  have  been  using  as  an  illustration,  we  can  proceed  as  follows.  We  remem,- 
her  our  four  regions  in  the  Briliouin  zone,  1,  2,  3,  and  4,  such  '.hat  in  region  1, 
is  positive  and  its  magnitude  is  greater  than  that  of  k^,  and  so  on.  Let  us  assume  that 
we  know  the  energy  of  each  of  the  two  bands  precisely  in  our  Bril' ouin  zo.ne.  Now  let 
us  define  a new  band  as  follows;  in  region  2,  it  will  consist  .of  the  lower  of  the  two 
energy  levels;  in  2,  of  the  higher;  in  3,  of  the  lowei-;  and  in  4,  of  the  higher.  Along 
the;  lines  k = t k the  energy  will  be  discontinuous.  From  this  band,  let  us  define 
a Wannier  function  in  the  usual  way:  it  will  then  prove  to  have  the  symmetry  behavior 
of  a p^  orbital.  In  fact,  if  the  non-diagonal  matrix  component  of  energy  between  the 

Bloch  functions  formed  from  p and  p orbitals  is  vanishingly  small,  this  Wannier 

X y 

function  will  be  precisely  the  p^  function.  Similarly  we  can  define  a second  band,  con- 
sisting of  the  higher  of  the  two  energy  bands  in  regions  1 and  3,  the  lower  in  2 and  4; 
the  Wannier  function  defined  from  it  will  behave  like  a orbital.  Under  the  action 
of  the  symmetry  operations  of  the  group,  these  two  Wannier  functions  will  tra»T?form 
like  a two-dimensionai  irreducible  representation  of  the  transformation  group  of  the 
square. 

This  shows  us,  ihen,  that  we  ca.n  introduce  Warunier  functions  in  various  dif- 
ferent ways,  though  only  the  way  we  first  described  is  consistent  with  defining  energy 
bands  in  which  the  enex  gy  is  a continuc>u.s  function  of  k.  The  new  Wannier  functions 
must,  of  course,  be  linear  combinations  of  Uie  old  ones.  We  can  it  we  choose  build 
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up  the  solution  of  the  problem  of  the  enei  level  sround  an  impurity  atom  as  a linear 
combinatinn  of  these  new  Wanniei  functions,  rather  than  of  the  original  ones.  But 
once  we  rea!’:?e  this,  we  ?;ee  that  we  can  really  accomplish  the  same  thing,  in  the 
tight  binding  approximation,  in  a much  more  straightforward  way.  Our  Wannier  func- 
tions  of  the  original  sort  are  linear  combinations  of  the  atomic  orbitals.  By  making 
linear  cornbiralions  of  these  Wannier  functions,  we  secure  other  Wannier  functions 
having  the  same  symmetry  as  ihc  atomic  orbitals.  But  now  we  .see  that  we  could  have 
saved  aii  Uus  work.  Any  linear  combination  of  all  the  Wannier  functions  formed  from 
a set  of  atomic  orbitals  can  equally  well  be  wi  ilLea  uii  ectly'  as  a linear  combina- 
tion of  the  original  themselves.  As  far  as  the  simplified  tight  binding  approxi- 
mation is  concernea,  we  can  treat  the  themselves  as  the  starting  point  of  the 

caicjlalion  of  irripui  lLy  ieveLs,  and  as  we  shall  sec  immediately,  the  same  matrix 
componenis  of  enci'gy  Lelween  these  atomic  orbitals  which  we  assume  tf>  gei  a correct 
representation  of  the  energy  bands  can  also  be  used  to  dic.cuss  impurity  levels.  This, 
then,  forms  a very  straightforward  method  of  discussing  such  levels,  which  we  shall 
consider  next. 

5.  Impurity  L^els  with  the  Tight  Binding  Approximation 

In  the  preceding  section  we  have  shown  that  if  we  start  wdth  atomic  orbitals, 
apply  the  simplified  tight  binding  apprtjximalLOii,  build  up  energy  bands  and  Bloch 
wave  functions  in  this  way,  solving  the  secular  equation  so  a.e  to  diagonalize  the  en- 
ergy, and  then  derive  Wannier  functions  from  '^he  resulting  energy  band  functions, 
the  final  Wannier  functions  are  linear  combinations  of  the  original  atomic  orbitais, 
located  on  many  different  atoms.  Our  next  step  in  building  up  a solution  of  the  per- 
turbed periodic  potential  problem  would  oe  to  expand  the  solution  as  a linear  com- 
bination of  the  Wannier  functions  on  the  various  atoms.  We  are  now  concerned  with 
the  case  where  a number  of  bands  are  close  enough  together  so  that  the  Wannier  func- 
tions of  all  these  bands  must  be  used  to  get  a good  representation  of  the  final  wave 
function.  In  this  case,  as  we  have  just  pointed  out  at  the  close  of  the  preceding  sec- 
tion, it  is  an  entirel.v  unnecessary  complication  to  introduce  the  Wannier  functions  at 
all.  We  can  get  just  the  sa.me  approximation  by  expanding  our  solution  directly  in 
terms  of  the  original  atomic  orbitals.  We  shall  now  show  that  this  procedure  can  be 
formulated  very  conveniently.  It  seems  likely  that,  in  the  present  state  of  develop- 
ment of  the  theory  of  energy  bands,  it  is  the  most  usable  method  for  handling  the 
energy  levels  of  impurity  atoms  with  overlapping  bands. 

lii  Eq.  (59),  Chapter  1,  'v<=  set  up  an  equation  for  the  matrix  components  of 
the  periodic  potential  function  between  two  atomic  orbitals.  If  i.5  the  periodic 
poteatial,  the  matrix  component  between  orbitais  and  - Rj).  where  n and 

m refer  to  two  different  types  of  orbitals,  such  as  an  s and  a orbital,  and  where 
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IL  is  a lattice  displacement,  was  written 


DX-)  V^(r) 


6 (r  - 


R ) - 


^ i'  i . 

' nm ' 1 ' 


(2.  44) 


We  called  the  diagonal  component  of  the  Hamiltonian,  when  n = ni,  = 0,  by  the 

symbol  E . bn1  there  is  no  reason  v/hy  we  cannot  use  the  symbol  ^ (O)  for  this 
quantity.  We  disregarded  all  lack  of  orthogonality  between  the  4>’s  on  different  atoms. 
We  set  up  in  Table  1,  Ch.apter  1,  miethods  of  reducing  these  quantities  to 

simpler  axpressions  in  terms  of  two-center  integrals.  Finally,  we  used  t.hese  ex- 
pressions as  disposable  parameters,  to  be  determined  in  such  a way  that  the  energy 
bands  derived  from  the  method  shonlH  fit  the  results  of  more  accurate  calculation, 
as  for  instance  by  the  orthogimalized  plane  wave  method. 

We  w’cre  interested  iri  Cnapter  i in  solving  the  periodic  potential  problem, 
without  perturbations.  We  did  this,  though  we  did  not  wi  iie  the  steps  down  explicitly, 
in  the  following  way.  We  assumed  that  the  wave  function  could  be  written  in  the  form 
of  a sum 


4i  = ^(n,  R.)  ’dp(K.)  <t>p(r' - S.),  (i- 45) 

where  the  U^'s  are  coefficients  to  be  determined.  Then  we  wrote  the  Schrodinger 
equation  for  this  wave  function  4<.  and  set  up  the  corresponding  secular  equation  for 
the  coefficients.  This  secular  equation,  making  the  assumptions  (2.  44)  regarding  the 
matrix  components  of  energy,  was 

^(m,R)  (R  - R.)  U (R.)  = EU  (R.),  (2.46) 

' ' nm-  i j'  m j'  n i 

We  then  assumed  that  the  coefficients  U ) could  be  written  in  the  Bloch  form 
exp(ik  • S.).  If  we  make  this  substitution  in  (2.  45),  we  have  a wave  function  of  the 
form  already  written  in  Eq.  (2.  39).  1 he  secular  equation  (2.  46)  becomes  simplified, 

in  that  we  find  that  our  assumption  automatically  satisfies  the  equation  for  all  values 
of  R^,  provided  only  we  satisfy  the  secular  equation 

y(m)  c ) ^ (R  ) exp(-ik  • R.)]  = E c . (2.47) 

^ ' m 1 nm  j'  ^ j J 

In  other  words,  the  c^'s  are  the  amplitudes  of  various  Bloch  waves,  and  the  matrix 
compoueiiia  uf  ciitigy  between  these  Bloch  waves  are  as  given  in  the  square  bracket 
of  (2.47),  agreeing  with  Eq.  (60)  of  Chapter  1.  vi  e sa'w  in  Chapter  1 how  cltcn  a rela- 
tively small  secular  equation  of  tVie  form  of  the  form  of  (2.  47)  could  give  satisfactory 
results.  Thus,  in  the  diamond  structure,  we  used  eight  atomic  orbitals,  and  s and 
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a p , p , p on  each  of  the  two  types  of  atoms  in  the  unit  cell,  so  that  we  had  an 
A y — 

eight-by -eij^ht  secular  equation,  whose  eigenvalues  E gave  the  energy  bands  described 
in  Chapter  1 . 

We  noticf'  that  Ecj.  (2.46)  is  very  much  like  Eq.  (Z.5),  in  which  we  set  up  the 

difference  equations  for  the  Wannier  functions.  The  only  difference  (aside  from  the 

fact  that  we  have  not  yet  introduced  a perturbative  potential  like  the  of  Eq.  (2.  5)) 

is  that  wc  now  have  non-diaconal  matrix  coin'^o'^e’^ts  IR  - R 1 b«>tweMn  <h's  with 

^ nm'"i  j' 

different  subscripts  n and  m,  whereas  in  (2.  5')  there  are  no  mati-ix  components  be- 
tween Wannier  functions  in  different  bands.  This  distinction  is  fundamental;  it  is  by 
introducing  such  non-diagonal  matrix  components  between  different  atomic  orbitals 
that  we  were  able  in  Chapter  1 to  set  up  the  secular  equation  which  gave  the  whole 
behavior  of  the  energy  bands.  V/e  can  get  rid  of  these  non-diagonal  matrix  components 
only  by  solving  (2.47)  for  the  energy  band  wave  functions,  defining  Wannier  functions 
from  these,  and  setting  up  difference  equations  in  terms  of  the  Wannier  functions. 

Wc  gain  one  thing  by  doing  this:  we  have  no  non-diagonal  matrix  components  of  en- 
ergy betwemn  Wannier  functions  in  different  bands.  Bui  we  lose  something:  we  have 

the  extended  type  of  Wannier  function  discussed  in  the  preceding  section,  with  non- 
diagonal  matrix  components  of  energy  between  Wannier  function.s  in  the  same  band  on 
quite  distant  atoms.  In  the  present  section  we  are  exploring  the  advantages  of  using 
the  original  atomic  orbitals,  in  which  we  must  consider  several  different  energy 
bands,  but  in  which  we  can  assume  that  the  non-diagonal  matrix  components  ~ 

Kj)  fail  off  quite  rapidly  as  the  atoms  get  further  apart.  For  most  purposes,  it 
seems  that  the  advantages  of  this  method  outweigh  the  disadvantages. 

Let  us  now  supplement  Eq.  (2.  46)  by  inserting  perturbation  terms,  like  those 
in  Eq.  (2.  5).  We  can  write  them  in  the  same  form,  as  there.  That,  is,  we  have 


y;( 


m 


• k^) 


U (R.) 

m y 


EU  (R  ). 
n'  i' 


{?..  481 


Except  for  the  one  fact  that  our  quantity  has  com.ponents  if  n f .m,  this 

equation  is  identical  with  (2.  5).  We  must  remember,  of  course,  that  the  rigorous 
tight  binding  calculation  would  be  much  more  complicated  than  this.  We  should  have 
to  include  terms  coming  from  lack  of  orthogonality  of  atomic  orbitals  on  different 
atoms,  from  three-center  integrals,  and  so  on.  But  we  have  already  seen  the  use- 
fulness of  the  present  approximations  in  the  case  of  finding  energy  bands  of  unper- 
turbed lattices.  The  advantage  of  the  present  scheme  is  that  we  can  now  use  the 
same  for  both  the  unperturbed  band  and  the  problem  of  the  perturbed  lattice, 

so  that  the  perturbed  energy  levels  will  be  consistent  with  a correct  solution,  to  the 
same  accuracy  to  which  we  have  been  able  to  fit  the  energy  bands  to  a correct  solu- 
tio.n.  The  of  course,  represent  matrix  components  of  a perturbative  potential, 
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such  as  that  produced  by  an  impurvty  atom,  between  different  atomic  orbitals;  and  Just 
as  with  the  's,  we  might  well  v/ant  to  treat  these  as  adiustaDie  Maiameters,  to 

approximate  m.ore  accurate  solutiwns  dttlermined  by  other  methods,  rather  than  C jm- 
puting  them  from  actual  atomic  orbitals. 

Now  that  we  tiave  set  up  Eq.  (2.  4S),  and  understand  its  implications,  we  can 
at  once  ansv.'er  the  question,  how  many  atomic  orbitals,  and  therefore  how  many  energy 
bands,  must  we  take  into  account  in  setting  up  the  energy  band  problem  properly.  Thus, 
for  instan(^e,  in  diamond  or  silicon  or  germanium,  if  we  have  perturbed  energy  levels 
near  the  conduction  or  vslence  bands,  we  shall  want  to  use  the  atomic  s and  p orbitals 
on  atoms  of  both  types  in  the  unit  ceil,  or  eight  orbitals,  just  as  in  discussing  the  en- 
ergy bands;  and  v/c  canno.  expect  to  obtain  a correct  picture  of  the  perturbed  levels  in 
any  simpler  way.  Similarly,  in  a simple  cubic  ciyslal  in  which  the  p levels  are  well 

separated  fr  om  s and  d bands,  we  should  have  to  use  the  p , p , p orbitals  as  a 

X z 

minimuiii.  Ordinarily,  in  practice,  the  s levels  lie  so  close  to  iiie  p that  we  cannot 
do  with  less  than  an  s orbit, ai  as  well  as  the  p's 

6.  General  Method  for  Solving  the  Difference  Equations 

In  the  preceding  sections,  we  have  seen  that  our  general  problem  of  finding 
tile  energy  levels  in  n perturbed  per-iodic  ia'.iLce  can  be  reduced  to  the  solution  of  cex  - 
taui  difference  equal  ions:  Eq.  {2.  5)  if  we  are  formulating  the  problem  in  terms  of 
V^annier  functior'.  (2.  48)  if  we  are  using  the  tight  binding  approximation.  We  have 
solved  these  diffei  nee  equations  in  a number  of  special  cases:  the  one -dimensional 
chain  with  neare.st  and  second -nearest  neighbor  interactions  in  Section  2,  the  three- 
dimensional  simple  cubic  lattice  with  nearest-neighbor  interactions  in  Section  3.  Our 
methods  have  been  ratiier  complicated,  and  applicable  only  to  special  cases.  For- 
tunately, however,  a far  more  general  method  of  solving  these  difference  equations 
exists,  reducing  to  the  values  previously  obtained,  but  capable  of  being  applied  to  a 
far  wider  range  of  problems.  This  method  is  now  being  worked  out  by  the  writer  and 
Dr.  G.  F.  Koster,  and  was  not  available  at  the  time  the  earlier  sections  of  this  chap- 
ter were  written.  Its  general  outline  will  be  presented  here,  though  its  applications 
have  not  yet  been  carried  further  than  the  exs.mples  alr.=;ady  discussed. 

This  method  is  based  on  a suggestion  of  Dr.  Koster,  to  expand  the  perturbed 
function,  not  in  terms  of  Wannier  functions  directly,  but  in  terms  of  Bloch  functions. 

We  shall  formulate  U first  for  the  simplest  case,  of  a single  non-overlapping  band 
with  a perturbation  only  at  the  atom  at  the  origin,  but  then  shall  extend  it  to  the  general 
case.  Let  us  use  periodic  boundary  conditions,  with  N atoms  in  the  repeating  region. 
V.^'o  have  Wannier  functions  a(r  - R.),  for  the  banr)  in  nuestion.  Then  the  corresponding 
Bloch  functions  are 
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Br 


(r)  = 


a'vi 


- R.) 


(2. 49) 


These  functions  are  normaiized  and  orthogonal,  and  there  are  N ot  them,  correspond- 
ing to  In  k values,  as  shown  in  Chapter  !.  The  matrix  coniponcnl  oi'  the  perturbative 
energy  betv.een  two  '.f  these  Bloch  functions  will  be 


In"  ‘ 7(R  , li  } V(R.,  K ) . 

^ 1 ,!  1 J 


(2.  90) 


where  V(H^,  Ky  is  ’he  matrix  component  of  the-  perturbation  betw-een  the  corresponding 
Wannier  function'-.  The  special  case  which  we  shall  take  up  first  is  that  in  which 
V(R  , R ) is  zero  unless  both  R.  and  R are  zero,  in  which  case  it  is  V , as  in  previous 
applications.  Then  the  matrix  component  (2.  50),  between  the  Bloch  functions  corres- 
ponding to  propagation  constants  k and  k',  is  merely  V^/N,  the  same  for  all  pairs  of 
propagation  constants.  It  is  this  vei’y  simple  form  of  the  matrix  compoiient  which 
leads  to  a very  simple  solution  for  this  problem. 


Let  us  now  set  up  our  solL.tion  as  a series  of  Eloi'h  functions,  ^(k) 


P.  i’-i 


Xhf-*n.  r^mcmb-3rir» 


rt  hV%.-.+  fVt' 


periodic  potential  problem  has  a diagonal  matrix  with  re- 


spect to  ti  e Bloch  functions,  with  matrix  components  E,  , the  equations  for  the  F,  ' = 

K K 


immediate! V become 


Vk  * »«'KvyN)F^,  . EF^ 

\Ve  can  immediately  solve  these  equation,  in  the  fui  m 


(2.  51) 


p 
■ k 


(V^/N)  ^(k-)  F^, 
e-Te-,; 


(2.  52) 


Since  2.,(k')  F^,  is  a constant  independent  of  k,  this  at  once  tells  us  that  Fj^  is  a con- 
stant divided  by  E - Ej^.-  which  gives  complete  solution  of  the  problem  of  determining 
the  wave  funr  tion;  the  constant  can  be  determined  by  the  normalization  condition.  We 
can  at  onre  find  an  equation  for  the  energy.  Let  (V^/N)  ^(k')  Fj^,  = A.  Then  from 
(2.  52),  Fj^  = A/(E  - E|^).  We  substitute  this  In  the  definition  of  A above,  and  find 
that 


(V  /N)  = 1 (2.53) 

o E - Ej^ 

Here  V^,  we  remember,  is  the  perturbative  potential  on  the  atom  at  the  origin,  N the 
number  of  atoms  in  the  repeating  region,  the  the  energies  of  the  N energy 


-46- 


6.  GENERAL  METHOD  FOR  SOLVING  THE  DIFFERENCE  EQUATIONS 


levels  in  the  unperturbed  energy  band.  Eq.  (2.  53)  then  is  an  equation  for  E,  determin- 
ing the  energies  of  the  perturbed  states  as  a function  of  the  perturbative  potential 

This  almost  ridiculously  simpie  result  is  ttie  compieLe  solution  for  the  energy 
levels  uf  the  pertuihed  problem,  when  our  conditions  are  satisfied,  of  a single  non- 
degenerate band,  and  a perturbing  potential  only  on  a single  atom.  It  includes  the 
cases  of  interaction  with  neighbors  at  any  distance,  including  the  two-  and  three- 
dimensional  problems.  We  shall  show  in  a moment  that  it  reduces  to  our  earlier  so- 
lutions in  the  cases  taken  up  before.  Before  doing  this,  however,  let  us  consider  the 
general  nature  of  Eq.  (2.  53),  and  make  some  deductions  about  the  energy  levels.  The 
function  ^(k)  I / (E  - Ej^)  becomes  infinite  when  E equals  any  one  of  ihe  Its 

reciprocal,  which  according  to  (2.  53)  equals  V^/N,  becomes  zero  at  each  of  the  Ej^/s, 
so  that  if  is  zero,  the  solutions  are  just  the  Ej^'s.  Between  each  pair  of  zeros  of 
this  reciprocal  function,  there  will  be  a simple  pole.  Thus  as  goes  from  zero  to 
positive  or  negative  infinity,  each  of  the  enei'gy  levels  will  be  displaced,  but  not  fur- 
ther than  the  nexl  adjacent  Ej^.  The  cnccpticnal  come  if  we  are  looking  for  a 

solution  outside  the  unperturbed  energy  band.  As  the  energy  gets  well  outside  the  un- 
perturbed band,  we  may  approximate  eacli  term  E - E.  by  E - <o{0),  where  <o  (0)  is 
the  average  enei'gy  in  ■'he  band,  '.fhus  the  summation  will  be  approximately  N/{E  - 
<y(0)),  so  that  (2.  53)  will  become  approximately  E = (S’ {O)  + V^.  But  this  betiavior 
is  jusi  what  we  found  in  the  specific  cases  we  vvorked  out  earlier,  the  discrete  state 
being  pushed  out  of  the  band,  the  remaining  states  of  the  band  being  only  slightly  per- 
turbed. We  now  see  that  this  behavior  is  perfectly  general.  Eq.  (2.  53),  giving  as 
a function  of  E,  or  E as  a furiclion  of  V , is  thus  the  general  equation  of  curves  like 
those  of  Figs.  2 and  4. 

To  show  the  power  of  our  method,  we  shall  use  it  to  discuss  the  one -dimen- 
sional chain  with  nearest-neighbor  interaction,  -and  then  the  three-dimensional  simple 
cubic  case,  which  we  have  taken  up  earlier.  For  the  one -dimensional  case,  we  know 
that  = <^(0)  + ZS(\)  cos  kR.  We  must  then  evaluate.-  the  sum  ^(kj  l/(E  - Ej^)  in 
{2.  53).  In  case  we  are  dealing  with  the  discrete  state,  so  that  E lies  outside  the  un- 
perturbed band,  we  can  convert  this  sum  into  an  integral  without  trouble.  We  find  that 
kR  goe.''-  by  equal  steps  from  - it  to  w,  or  from  0 to  2tt,  there  being  N intervals.  That 

is,  we  can  replace  ^(k)  by  (N/2u)  j ^d(kR).  Thus  (2.  53)  reduces  to 

*'  o 


-2it 

(V^/N){N/2Tr)  j (E  - dS  {O)  - 2 £5"  (l ) cos  a)'  ' do 

' o 


= 1. 


This  can  be  integrated  by  elem.e.ntary  means,  and  gives 


i . 


(2.  54) 


iZ..  55) 
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When  we  solve  (Z.  55)  fu/'  the  energy  E.  the  result  is  identical  with  Eq,  (Z.  19),  which 

we  obtained  for  the  discrete  state  by  our  straightforward  solution  of  the  problem. 

Simiiai'ly  we  can  take  the  thr oc -dimenai ,.nn!  simple  cubic  lattice.  Here  = 

<5’(0)  T 2<^(i)(cos  k.H  + cos  k^H  + t.os  k^R),  as  we  found  in  Section  3,  wlicre  kj,  k^. 

are  the  three  components  of  the  vector  k.  Just  as  before,  we  can  convert  the  sum 

into  an  integral,  replacing  ^(k^b^  (N/tZir)^)  d'k  , R)  d{k^H)  f d(k.^li).  Whan 
, ^ Jo  -Jo  ^ J o 

we  substitute  lii  '2.  53),  we  then  have 

, -X  /*2ir  fZv  /"ZiT  r , ,1  ■ 1 

V^/{2tt)  j j I E - <i'{0)  - 2<^(l)(cos  a,  + co'  t cos  c.^)| 


(2.  56) 


UQ  J da^  da.^  - 1 , 


just  analogous  to  (2.  54).  W'e  can  integrate  once,  over  a,,  without  trouble,  obtaining 


,2it  -Z-rr 


V /(Ztt)^  / / l/a/|E  - <f(0)  - 2<f(l)(cos  r cos  a^)]  ^ - (2<^’(l))' 

' o ’ o 


da  J da^  - 1 • 


(2.  57) 


But  this  equati  -n  can  be  at  once  converted  to  Eq.  (2.  35),  which  we  obtained  by  our 
earlier  method,  so  that  we  are  led  to  exactly  the  same  situation  which  we  have  already 
discussed.  The  po'  er  of  the  present  method  is  shown  by  the  direct  way  in  which  we 
have  been  able  to  get  at  this  result  for  the  energy;  and  also  in  the  fact  that  the  form 
(2.  56)  for  the  condition  exhibits  directly  the  symmetry  between  x,  y,  and  z wliich.  was 
lacking  in  our  earlier  treatment,  and  is  lacking  in  (2.  57),  in  which  we  have  already 
integrated  once. 

Even  in  cases  where  the  integration  cannot  conveniently  be  carried  out,  Eq. 

(2.  53)  can  still  give  useful  results,  by  using  it  directly  as  a sum.  Thus,  for  instance, 
let  us  consider  the  one-dimensional  chain  with  second-nearest-neighbor  interactions, 
as  in  Eq,  (2.  22).  We  do  not  get  a convenient  result  if  we  convert  )2.  53)  into  an  inte- 
gral. However,  we  may  apply  periodic  boundary  conditions  with  a fairly  smab.  value 
of  N,  the  number  of  atoms  in  the  repeating  region,  just  as  we  did  in  constructing  Figs, 
i and  2,  v/here  we  took  N = 6 for  the  case  of  nearest-neighbor  interactions.  3^hen  the 
N values  can  be  at  once  computed,  and  (2.  53)  immediately  gives  us  as  a function 
of  E,  in  an  algebraic  form  w'hich  can  be  conveniently  handled.  In  a similar  way,  even 
in  three  dimensions,  we  could  get  nume>".cai  values  by  using  jjeriodic  ’ooundary  condi- 
tions with  small  N,  dividing  our  Brillouin  zone  into  a finite  network  of  points,  and 
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findinj;  tiie  E|^'s  for  these  poims.  In  iius  way  we  get  an  t-Aaci  soiuiioii  fui'  liie  problem 
of  sn.sH  N,  and  wc  may  reasonably  assume  that  if  N is  chosen  fairly  large  the  results 
will  approximate  those  for  a very  large  N,  such  as  we  should  use  for  really  accurate 
results 

Our  methods  allow  t.s  to  approximate  the  wave  functions,  as  well  as  tVie  en- 
ergy. We  know  that  the  wave  function  can  be  written  as  ^(k)  Fj^^Bj^(r),  where  F.^  - 
A/(E  - is  a Bloch  function,  and  A is  a constant,  which  can  be  determined  by 

normalizing  the  wave  function.  We  are  more  intere.sted  in  finding,  not  the  coefficients 
of  the  Bloch  functions,  but  the  coefficients  of  the  Wannier  functions,  since  we  know 
from  our  earlier  treatment  in  this  chapter  that  these  coefficients  will  have  the  general 
nature  of  a continuous  function  of  atomic  position.  Let  us  then  substitute  for  the  Bluoh 
functions  in  terms  of  the  Wannier  functions,  from  Eo-  (2.49).  When  we  do  this,  we 
find  that  the  wave  function  equals 


AN-  V(K^)  l^(iT)  a{r  - K.).  (2.  t>8) 

In  other  w'ords,  the  coefficient  of  the  Wannier  function  on  the  atom  at  R^,  the  quantity 
called  U{R.)  in  our  earlier  work,  is 


U(R.) 


AN 


1/2 


Z(k) 


E - Ek 


(2.  H9) 


This  simple  result  is  the  complete  solution  of  the  problem  of  finding  the  wave  function 
in  the  presence  of  an  impurity  producing  a perturbation  only  on  one  atom.  It  is  to  be 
understood,  of  course,  that  the  E which  appears  in  (2.  59)  is  that  which  results  from 
solution  of  Eq.  (2.  53),  which  is  essentially  the  secular  equation.  It  is  interesting  to 
note  that  since  Ej^  is  a periodic  function  of  position  in  k space,  we  can  make  a Fourier 
expansion  of  the  function  l/(E  - Ej^).  the  coefficients  being  associated  with  the  various 
points  R.  of  an  ordinary  space.  It  is  clear  from  (2.  59)  that  the  U(R.)'s  are,  except  for 
a constant  factor,  just  these  Fourier  coefficients.  We  can  now  show  by  direct  integra- 
tion in  simple  cases,  such  as  the  linear  chain  with  nearest -neighbor  interactions,  that 
(2.  59)  leads  to  the  same  re.sults  already  found  for  the  wave  function. 

In  simple  cases,  we  can  use  Eq.  (2.  59)  to  obtain  results  about  the  wave  func- 
tion in  an  analytic  way.  Thus,  let  us  consider  the  problem  of  a three-dimensional  lat- 
tice, in  which  the  c-nergy  E,  can  be  expanded  about  a minimum  or  maximum  energy, 

which  we  take  to  come  at  k = 0,  in  a power  series  starling  wiih  the  terms  a(k  + 

2 2 2 2 ^ 
k + k ),  where  a - h /8w  m,  m beine  an  effective  mass.  Lei  us  have  a perturba- 

tion  at  the  atom  at  the  origin,  such  that  the  discrete  level  lies  at  an  energy  E,  below 
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tlie  bottom  of  the  band  (and  hence  negative,  if  the  bottom  of  Ihe  band  is  a.t  vf-rr!  energy) 

Then  E E,  ca*^:  be  approximated  by  E - a(k  ^ + k ^ + k ^),  which  is  numerically  small 
K ■ X y z 

near  k = 0,  but  increases  rapidly  as  we  go  av/ay  from  this  point.  The  significant  con- 
tributions to  the  sum.  or  integral  in  (2.  59)  will  then  come  from  small  values  of  k,  and 
we  shall  not  make  a serious  error  if  we  integrate,  not  merely  over  the  unit  cell  in  k 
space,  but  out  to  infinity.  The  quantity  l/(E  - Ej^)  will  depend  only  on  tlie  magri.tude 
uf  k.  to  the  approximation  we  are  using,  so  that  in  carrying  out  the  integration  over  k, 
we  may  first  average  ovei-  angles,  in  which  case  the  exponential  e‘"  can  be  re- 

placed by  the  spherical  Bessel  function  sin  kR^/(kH^),  where  here  k and  R,  stand  for 
the  magnitudes  of  the  vectors.  Thus  the  sum  in  (2.  59)  becomes  pr'opo”Conal  to 


^ 2Trk  sin  kR. 

/ ^ 

•'o  (E  - ak“)  kR. 


dk 


tt“  - vRj 


(2.  60) 


where  y = (-E/a)^^^  = (2ir/h)(  2mE)^'^‘^.  But  this  simply  tells  us  that  the  quantity 
U(R.)  is  proportional  to  e as  we  deduced  for  tins  case  in  Section  3,  and  the 

value  of  y W'e  have  found  here  is  the  same  one  found  there. 

We  can,  in  other  woi'ds,  reproduce  the  results  of  the  free  electron  approxima- 
tion for  the  wave  function  very  easily;  but  at  the  same  time  we  have  the  machinery  for 
easily  improving  the  approximation.  We  merely  hav't  to  investigate  the  change  in  the 
■Slim  or  integral  in  (2.  59)  v/hen  we  take  a better  approximation  to  the  energy  Ej^.  In  the 
limit  as  we  have  a very  lai'ge  perturbation,  so  that  the  energy  E departs  widely  from 
the  energy  band,  the  constant  term  in  the  Fourier  represeniaiion  will  outweigh  any 
other  terms;  this  means  that  U{R^  is  much  larger  for  = 0,  or  on  the  perturbing 
atom,  than  on  any  other  lattice  site,  as  we  know  should  be  the  case  fi'om  our  earlier 
qualitative  discussion.  But  it  would  b-c  an  easy  thing,  with  any  given  function  £j, , to 
obtain  numerical  values  for  the  other  Fourier  components,  or  other  U(R.)'s,  to  any  de- 
sired degree  of  approximation.  By  Eqii.  (2.  53)  and  (2.  59),  Wv-,  have  reduced  the  prob- 
lem of  any  lattice  with  a single  perturbing  atom  to  quadratures,  ami  as  more  examples 
are  worked  out,  the  wave  functions  and  energy  levels  of  the  discrete  .states  in  such 
problems  can  be  completely  investigated. 

Next  we  shall  consider  the  more  complicated  case  where  the  perturbaTve  po- 
tential is  felt  over  a number  of  atoms,  and  where  a number  of  overlapping  bands,  with 
their  corresponding  Wannier  functions,  are  concerned.  Before  carrying  through  the 
analytical  treatment  of  this  case,  we  shall  state  the  results  in  words.  We  find  that  the 
solution  can  be  written  as  a linear  combination  oi  functions  of  the  type  we  have  just 
been  considering,  ai  ising  f^'orn  perturbations  on  single  atums.  One  such  function  is  to 
be  centered  on  each  atom  on  which  there  is  a perturbative  potential,  and  one  is  to  be 
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oonstruc+pd  from  the  Wannier  function  of  each,  band  coiicerned.  The  number  of  coeffi- 
cients of  these  functions,  then,  will  equal  the  number  of  atoms  on  which  tV>c)-c  is  per- 
t>!,rbative  potential,  multiplied  by  the  number  of  Wannier  functions  concerned  or,  each 
atom*.  Those  coefficients  are  determined  by  a set  of  simultaneous  homogeneous  linear 
equations,  which  are  incompatible  unless  the  determinant  of  coefficients  vanishes. 

This  furnishes  a secular  equation  for  the  energy,  though  it  is  not  of  the  usual  type  of 
secular  equation.  Ail  the  various  funclious  which  ui'o  superposed  to  find  the  final  solu- 
tion are  to  be  computed  for  the  energy  determined  from  this  secular  equation.  Con- 
sideration of  symmetry  properties  uf  the  i esuliic.g  solution  v>ill  oj.dhia.rily  lead  to  re  - 
lations betwi  m the  coefficients,  reducing  greatly  the  order  of  the  secuiai  equation  to 
be  solved;  in  some  cases  it  can  remove  entirely  ihe  requirement  of  solving  a secular 
equation,  and  lead  directly  to  the  final  wave  function. 

The  statements  which  we  have  already  made  about  the  ivave  function  can  lead 


to  useful  information.  Tlius,  ioi-  the  discrete  state,  ihe 


jiem  with  a si.ngle  atom  on 


which  there  is  perturbatirni  leads  to  a wave  function  falling  off  exponentially  with  the 
distance,  or  perhaps  as  e ^'"/r.  If  we  superpose  such  functions  on  a number  of  neigh- 
boring atoms,  then  no  matter  w.hat  may  be  the  coefficients,  we  shall  still  find  that  at  a 
uLStanc;e  from  these  perturbed  atoms,  the  total  wave  function  will  fall  off  in  much  the 
same  way  as  if  there  were  perturbation  on  only  one  atomi.  Superposition  of  such  func- 
tions lo  give  a resulting  wave  function  having  the  symmetry  behavior  suggested  by  vari- 
ous spherical  harmonics  of  angle  will  then  give  functions  behaving  in  general  like 
spher  ical  harmonics  of  angle  umes  functions  of  angle  which,  like  e / r,  can  be  writ- 
ten in  terms  of  spherical  Bessel  functions  of  unaginary  argument.  Consic'craticr.s  like 
this  can  lead  to  a good  deal  of  information  about  the  wave  functions  of  the  discrete 
states,  even  without  the  need  of  solving  the  pi  u'ulem  completely. 

Let  us  now  see  how  to  verify  the  statements  which  we  liave  made.  I.n  the  first 
place,  we  generalize  our  earlier  work  by  including  several  bands.  We  give  the  Wan- 
nier functions  a (r  - K.)  a subscript  n referring  to  the  band  we  arc-  dealing  with,  and 
n 1 

similarly  for  the  Bloch  function  B^  j^(i- ),  which  i.s  still  defined  by  (2.  49).  The  matrix 
componenl  of  pert-urbativc  cnei-gy  between  Bloch  functions  B_  ,-:^(r ) and  B_^, 


N‘  ^ Rj)  ^ n-^^i’  ^3^' 


(2.61) 


where  *^*^'*^  matrix  component  of  perturbative  energy  between  the  Wan- 

nier function  of  the  n^^  band  on  atomic  site  at  R.,  and  that  of  the  band  n'  at  R . Then 
as  before  we  set  up  a solution  consisting  of  a sum  of  Bloch  function,  J^(n,  k ) ^ 

B^  j^(r  ).  The  equations  for  the  F's  are 

E rE  r*  + 7(n',k'„  R..  R.)  N"  ^ V „,(R. . R ) F^,  = EF 

n,  k n,  k ru'  ’ ■ i’  j'  n,  n"'  i j'  n',K'  n,  k 


(2.  62) 
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lei 


>'(rv.  IT'.  R ) N'  ^ ■ "jT  ,(R,.  R J K , = A 3- 

• j'  1 n'.k'  n,  R. 

Then  we  ean  rewrite  Eq.  (2.  62)  in  llie  forni 


(E  - E r^)  F - = > (R.)  e"’*^  ■ A 5 , 
i'i,  K ' n,  k r 11,  R. 


whose  solution  at  once  is 


F iT  ==  y(R.)  A ^ 

n,  k ^ V n,  R. 


- ik  • Rj 


1 E - E 
1 n,  k 


(2.6  3) 


(2.  64) 


(2.65) 


We  can  now  substitute  Eq,  (2.  65)  in  the  left  side  of  (2.  63),  obtaining 


'n,  k ’ 


- 6 


6 

n,  n*  °K..  R, 

1 i 


}.o. 


{2 . 66) 


From  Eq  (2.  63),  we  can  see  at  once  that  the  only  non-vanishing  K . 

tliose  for  which  one  or  more  of  the  V ,(R  , R )'s  are  different  from  zero;  that  is, 

n,  n'  1 j 

those  referring  to  atomic  sites,  and  Wau:  ier  f inctions,  for  which  there  is  a diagonal 
or  non-diagonai  matrix  component  of  the  perturbative  energy  to  some  other  atomic  site 
and  Wannier  function.  In  our  earlier  case,  there  is  only  one  such  site  and  Wannier 
function,  so  that  we  niel  only  one  coefficient  A,  relating  to  the  atomic  site  at  the  origin. 
If  only  a finite  number  of  atomic  sites  and  enei-gy  bands  are  concei  ned,  we  shall  have 
a corresponding  finite  number  of  non-vanishing  A^  r.'“’  Then  in  Eq.  (2.  66)  we  have 
a set  of  simultaneous  linear  homogeneous  equations  for  these  A's,  and  as  usual  we  can- 
not get  solutions  unless  the  determinant  of  coefficients  vanishes.  This  furnishes  us 
with  a secular  equation  for  the  energy  E,  though  it  is  not  of  the  usual  form  met  in  quan- 
tum mechanics,  since  the  energy  E appears  in  the  denominators  rather  than  in  the  usual 
w'ay.  Once  this  secular  equation  is  solved,  so  that  we  know  the  energy,  we  can  then 
solve  Eqs.  (2.  66)  to  find  the  A's,  and  then  substitute  in  (2.  65)  to  get  the  wave  functions. 

As  in  our  earlier  simpler  case,  it  is  easier  to  interpret  the  result  if  we  con- 
sider, not  the  coefficients  F of  the  Bloch  functions  in  the  expansion  of  the  final 
wave  function,  but  the  corresponding  coefficients  U^(R.)  in  the  expansion  of  the  wave 
.''unction  in  terms  of  Wannier  funciimi^.  If  we  proceed  as  before,  using  Eq.  (2.49),  v/e 
find  at  once  that 
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U (U.) 
n 1 


^ iN 

n,  K . 

J 


1/2 


r'-i; 


E - E 


/ -»  Z ’ \ 

vt..  O . / 


n,  k 


!/2 


X(k ) N 


E - E 


n,  k 


is  proportional  to  the  solution  of  the  problem  of  a 


perturbed  lattice  with  only  a perturbative  potential  at  the  lattice  site  R^  as  v/e  see  from 
Eq.  (2.59).  Thus  we  verify  oui-  eaiTiei-  statement  that  tiie  whole  solution  of  our  pres- 
ent problem  is  a linear  combination  of  such  solutions,  located  on  all  the  various  lattice 
sites  for  which  there  are  perturbative  potential  terms,  and  corresponaing  to  the  Wannier 


functions  of  all  the  bands  concerned.  The  quantities  A 


n,  x»j 

plying  these  various  solutions,  and  the  equation."-  (2.  66)  are  those  determining  these  co- 
efficients. We  remember  th.at  the  energy  E appearing  in  (2.  67")  must  be  that  deter- 
mined from  the  .secular  equation  found  from  (2.  66).  We  now  note  also  that  the  sums 
appearing  in  this  secular  equation  are  of  the  same  foi  m as  those  appearing  in  (2.  67),  so 
that  as  far  as  mathematical  technique  is  concerned,  the  proolems  all  reduce  lo  the  cal- 
culation of  such  sums,  or  to  the  solution  of  the  problem  of  a perturbation  at  a single 
lattice  point. 

We  thus  have  a formal  solution  for  a very  general  type  of  perturbation,  and 

tnc  power  of  our  method  is  shown  in  the  fact  that  the  resulting  secular  equation  has  an 

order  equal  only  to  the  total  number  of  atoms  on  which  there  is  a perturbative  potential, 

times  t**c  nuiiifeer*  of  ovei'lcippin^  enei'^y  conccxTrisd.  As  we  iieve  mentioneci  esr** 

Her,  symmetry  can  often  lead  to  many  relations  between  the  A ^ 's,  greatly  reducing 

n,  xtj 

the  order  of  the  secular  equation.  The  method  just  described  is  sic  new  that  no  ex- 
amples have  yet  been  worked  out  using  it,  beyond  the  very  simple  ones  which  uave  al- 
ready been  described.  But  it  should  lead  in  time  to  a complete  mastery  of  the  Jlffei  - 
cnce  equation  method  of  dealing  with  perturbed  potentials  in  terms  of  Wannier  func- 
tions, so  that  the  view  expressed  in  earlier  sections,  to  the  effect  that  the  difference 
equation  method  of  dealing  with  such  problems  is  better  than  the  appreximate  differen 
tial  equation  method,  seems  entirely  justified. 

The  method  of  the  present  section,  in  cases  where  we  have  overlapping  bands, 
depends  on  having  "Wannier  functions,  and  energies  E^^  for  the  unperturbed  problem 
which  really  diagonalize  the  energy/  of  that  problem.  Thus,  if  we  are  starting  with  the 
simplified  tight  binding  approximation,  as  discussed  in  earlier  sections,  our  first  step 
in  carrying  through  the  present  analysis  must  be  to  solve  the  periodic  potential  prob- 
lem, and  set  up  the  Wannier  functions  and  energy  levels  by  that  method.  Then  we  can 
proceed  to  apply  the  present  method.  The  details  of  doing  this  have  not  yet  been 
worked  out,  but  the  general  outlines  of  the  method  are  clear. 


are  the  coefficients  multi- 
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7.  AdriiTon.il  Types  of  Perturbative  Potentials 

in  the  pi  eceuing  sections,  we  heve  ueen.  d.scus-ing  the  energy  levels  in  the 
presence  of  a pe»'turbation  of  the  type  which  would  be  pr(;duced  by  an  impurity  atom 
subsliiuted  for  cr.c  of  the  atoms  of  the  crystal.  Most  of  our  analysis,  however,  is  more 
general  than  this,  and  in  the  present  section  we  shall  mention  a number  of  other  types 
of  perturbation  which  ai'e  important.  Some  extensions  are  obvious.  F'or  instance, 
there  is  the  well-known  problem  of  the  F center  in  an  alkali  halide  crystal.  This 
arises  when  there  is  a missing  halogen  ion  in  the  crystal.  In  such  a case,  the  per- 
turbative potential  is  essentially  the  negative  of  the  potential  around  such  a halogen  ion, 
in  a dielectric  such  as  is  provided  by  the  crystal.  Since  the  halogeri  ion  is  negatively 
charged,  this  is  a positive  potential,  or  negative  potential  energy  for  an  electron,  in 
all  the  region  surrounding  the  missing  ion,  but  a positive  potential  energy  for  the  elec- 
tx-un  in  the  region  from  which  the  ion  is  missing.  The  Couioiub  attraction  surrounding 
the  site  of  the  missing  ion  will  result  in  bound  energy  levels  for  an  electron,  and  the 
lowest  of  these  bound  levels  will  correspond  to  the  energy  level  concerned  iti  rhe  F 
center;  an  electron  becomes  trapped  in  such  an  energy  level,  and  can  be  ejected  to  an 
excited  level  by  absorption  of  energy. 

It  is  now  interesting  to  ask  how  the  wave  function  of  this  lowest  bound  level  is 
to  be  built  up.  We  may,  in  accordance  with  the  preceding  sections,  make  it  up  by  su- 
perposing atomic  orbitals  on  the  various  atoms  of  the  crystal,  wdth  amplitudes  which 
will  rapidly  decrease  as  we  go  away  from  the  missing  ion.  These  amplitudes  v.ill  hr. 
determined  by  our  difference  equation,  which  resembles  a diffeiential  equation  for  a 
Schrddinger  problem  derived  from  the  perturbative  potential  whic'n  is  repulsive  at  the 
position  of  the  missing  ion,  attractive  at  all  .other  atomic  sites,  but  rapidly  decreasing 
in  magnitude  as  we  go  away  from  the  missing  ion.  A reasonable  first  approximation  to 
this  wave  function  is  to  suppose  that  there  is  no  contribution  at  the  site  of  the  missing 
ion,  on  account  of  the  large  repulsive  potential,  but  that  the  main  contributions  come 
on  the  alkali  ions  which  are  the  nearest  neighbors  of  this  missing  ion,  with  rapidly  de- 
creasing contributions  on  more  distant  ions.  It  has  been  shown  by  a number  of  workers^ 
that  we  get  quite  an  acceptable  re.sult  by  merely  superposing  atomic  functions  on  the 
six  alkali  ions  forming  the  nearest  neighbors  of  the  missing  ion.  We  can,  of  course, 
make  six  possible  linear  combinations  of  these  six  atomic  functions,  whose  coefficients 
are  at  once  determined  by  requirements  of  symmetry.  The  sum  of  the  six  gives  a wave 
function  of  the  s-like  *ype  of  symmetry  in  the  cubic  crystal,  and  we  can  in  addition 
form  three  p-like  functions  transforming  according  to  a three-aimensiunal  irreducible 

't.  Muto,  Prog.  Thcoret.  Phys.  4,  243  (1949) 

'V.  Inui  and  Y.  Uemura,  Prog.  THeorct.  Phys.  5,  252,  395  (1950) 

. K.  Kahn  and  C,  Kittel,  Phys.  Rev.  3 1 5 (L)(  1 953). 
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reprt'senialion  ol  the  cubic  space  group,  ana  two  functions  tr ansfoiin . rig  at  coiiilug  n;  a 
tvvo-dimensiona]  representation.  The  fact  that  we  can  get  an  acceptaole  wave  fiinction 
in  this  very  simple  '.vay.  agreeing  with  experiment  in  a number  of  respects,  gives  us 
courage  to  hope  that  in  other  cases  an  impurity  wave  function  consisting  of  atomic  func- 
tions on  a very  linnited  number  of  atoms  surrounding  an  im.purity  atom  may  be  fairly 
satisfactory.  This  is  of  course  a case  with  a large  perturbative  energy,  concentrated 
in  a restricted  neighborhood,  for  which  in  any  case  we  should  expect  a wave  function 
failing  off  rapidly  v/ith  distance. 

There  is  one  feature  met  in  this  F-center  calculation  which  is  also  encountered 
in  any  other  real  problem  of  impurity  atom.s.  The  atoms  in  the  neighborhood  of  the  im- 
purity will  be  subjected  to  stresses,  on  account  of  the  fact  that  the  impurity  atom  will 
be  of  different  size  from  the  atoms  of  th<r  lattice,  aiid  '•■ill  therefore  bo  displaced  slight- 
ly from  their  normal  positions.  Thus,  with  a missing  ion  as  in  an  f’-center,  the  neigh- 
boring atoms  will  tend  to  be  pushed  into  the  hole  left  by  the  missing  ion,  provided  an 
electron  is  occupying  the  impurity  level  to  provide  electrical  ncutralit  •,  otherwise, 
there  v/ould  be  stiung  electrostatic  forces  which  would  tend  to  repel  the  neighboring 
alkali  ions.  The  exact  determination  of  the  resulting  forces  and  displacements  is  a 
very  complicated  problem.  In  principle,  we  should  solve  for  the  electronic  energy  of 
the  crystal,  as  a function  of  the  positions  of  the  neighboring  atoms,  and  vary  the  atomic 
positions  to  minimize  this  energy,  in  order  to  find  the  positions  of  equilibrium  of  all 
aton.'s.  In  practice,  this  is  so  hard  to  do  that  simplified  approximate  methods  at.  ays 
have  to  be  used  to  estimate  the  atomic  displacements. 

Similar  questions  come  up,  and  have  been  considered,  in  problems  of  luminescence, 

where  the  sensitization  of  the  luminescence  of  a crystal  comes  from  the  presence  of  an  im- 

/ 1 \ 

purity  atom.  Thus  Williams,  in  considering  the  luminescence  of  a KCl  crystal  activitated 
■+  + 

by  replacing  a K ionbyaTl  ion,  has  investigated  the  electronic  energy  levels  surrounding 
the  Tl^  ion  as  a function  of  the  position  of  the  neighboring  Cl  ions,  which  are  assumed  capable 
of  moving  radially  in  or  out.  Williams's  calculation  of  the  electronic  energy  levels  does  not 
follow  very  closely  the  model  wliich  we  have  proposed,  but  we  must  notice  that  there  are  fea- 
tures of  our  method  which  are  certainly  not  very  appropriate  in  such  a case.  We  have  tacitly 


CXOOUiXlULi  tllcvv  viic  pc 
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'turbation,  even  at  the  position  of  the  perturbing  ion,  was  small 


enough  so  that  it  was  reasonable  to  use  the  Wannier  functions  of  the  original  atom  to  expand  the 
wave  function,  even  at  the  site  of  the  perturbing  ion.  Thus,  in  this  case,  our  method  if  used 
lit-oraily  would  invclve  building  up  an  impurity  wave  function  which  would  be  made  of  K orbi- 
t.oTs  on  the  T 1^  ion,  as  well  a.=.  Cl  orbitals  on  the  neighboring  chlorines.  This  is  obviously  ab- 
surd, in  such  a case  as  this,  where  the  Tl^  is  very  different  from  the  . It  is  clear  that  the 
spirit  of  our  tight  binding  approximation  would  suggest  in  „'ich  a case  that  we  build  up  a wave 


'f.  E.  Williams,  J.  Chem.  Phys.  19,  45"^  (1951). 
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I'unction  out  ot  T1  atomic  orbitals  on  the  Tl^  ion,  tojjether  with  Cl  orbitals  on  the  Cl  's,  ar.dK 
orbitals  on  the  more  distant  icns  seems  likely,  from  the  discussion  which  we  have  just 
given  of  F-centers,  that  quite  a good  approximation  v.'uald  be  secured  by  combining  a Tl  or- 
bital on  the  Tl^  ions  with  Cl  orbitals  on  the  neighboring  Cl  's,  using  a linear  combination  of 
these  Cl  orbitals  having  the  same  symmetry  as  the  Tl  orbital.  This  is  not  the  method  used  by 
Vv'liiaTnS;  but  suggests  a useful  approach  to  such  problems. 

Another  type  of  problem  which  can  be  h.andled  by  the  type  of  approach  we  have 
suggested  is  tiiat  of  surface  states,  it  was  pointed  out  long  ago  by  Tamm  (references 
to  papers  or  surface  states  are  given  in  the  bibliography  at  the  end  of  this  chapter)  tnat 
if  we  have  a one -dimensional  periodic  potential  problem,  consisting  of  a set  of  poten- 
tial wells,  but  bounded  at  one  end.  and  terminated  by  a region  where  the  petential  is 
constant,  wc  often  have  one  energy  level  split  off  frtnri  the  continuum,  associated  with 
a wave  function  which  falls  off  exponentially  as  we  go  away  in  either  direction  from  the 
point  of  discontinuity  between  the  periodic  region  and  the  constant  potential.  We  can 
set  up  a similar  problem  for  a one -dimensional  chain,  handling  it  by  the  tight -binding 
approximation.  Such  a treatment  was  given  by  Goodwin,  by  a method  equivalent  to 
those  taken  up  in  the  present  chapter.  His  assumption  was  equivalent  to  the  following 
m.odification  of  our  problem  of  Section  2:  we  assume  ]ust  the  same  behavior  as  in 

Eq.  (2.  13)  for  positive  p,  assume  as  in  that  equation  a perturbative  potential  V(C)  on 
the  atom  at  the  origin,  but  assume  that  there  are  no  atoms  for  negative  p,  so  that 
(J(-  I)  is  to  be  set  equal  to  zero.  In  this  case  it  is  easy  to  solve  the  equations,  and  we 
find  a bo’ind  state,  similar  to  that  discussed  in  Section  2,  utider  some  circunistarces. 
This  bound  state  is  only  present  if  V(0)  is  greater  in  magnitude  than  (i),  as  we 
readily  find,  and  if  it  is  present,  its  energy  depends  on  the  magnitude  of  V(0).  Some- 
what .similar,  but  by  no  means  equivalent,  results  have  been  obtained  using  a one- 
dimensional  periodic  petential,  not  only  by  Tamm,  as  mentioned  earlier,  but  by  a 
variety  of  other  writers,  including  Mane,  Shockley,  and  otliers. 

There  have  been  some  attempts  to  correlate  the  wave  functions  so  found,  fall- 
ing off  exponentially  as  we  go  away  from  the  surface,  and  called  surface  states,  with 
observed  energy  levels  on  the  surfaces  of  certain  crystals,  such  as  germanium.  Un- 
doubtedly, in  a general  way,  there  is  such  correlation.  Surely  ihe  surface  atoms  of  a 
crystal  are  in  a sufficiently  different  environment  from  those  in  the  interior  so  that 
they  act  almost  like  different  types  of  atoms,  and  a treatment  based  oii  handling  them 
like  impurity  atoms  will  have  elements  of  truth.  However,  the  real  situation  is  almost 
certainly  so  much  m.ore  complicated  than  the  cases  which,  have  been  treated  theoretically 
that  it  would  be  dangerous  to  draw  any  conclusions  regaiding  real  surface  states  from 
the  existing  liieory.  The  reason  for  this  is  to  a considerable  extent  exper imeniah  the 

‘E.  T.  Goodwin,  Proc.  Cambridge  Phil.  Soc.  35,  22  1 (1939). 
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surfaces  found  on  leai  crystals  are  almost  certainly  quite  different  from  these  as- 
sumed in  the  simple  thcoiTes,  vvlici  c we  mei  eiy  assume  that  we  nave  a periomc  poten- 
tial up  to  the  surface,  and  then  go  into  the  vacuum.  Real  surfaces  are  almost  cer- 
tairdy  contaminated  by  impurity  atoms  very  tightly  held  to  tha  surface  atoms,  and  the 
problem  which  should  be  solved  is  that  cf  such  a composite  surface.  Urdortunately, 


Hrv  not  Irnou/  ti'Uot 
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metrical  arrangcn'ierit  Until  this  is  better  understood,  from  an  experimental  point  of 
view,  the  theory  can  hardly  miake  much  prog'^ess. 

The  sort  of  thing  to  be  expected  can  be  seen  by  considering  the  case  of  ger- 
manium If  we  cleaved  a germanium  crystal  on  a 111  face,  for  example,  we  Cuui.l  find 
a surface  layer  of  atoms,  each  being  held  to  the  atoms  below  it  by  three  of  the  four 
normal  covalent  bonds,  but  with  the  fourth  bond  sticking  out  from  the  surface  in  an  un- 
saturated  state;  or  alternatively  each  surface  atom  could  he  held  to  the  atoms  below 
by  one  bond,  three  being  unsaturated.  We  can  hardly  assume  that  the  surface  would 
remain  in  this  condition,  in  the  presence  of  a gaseous  atmosphere.  Practically  cer- 
tainly the  unsaturated  bonds  would  attract  atoms  or  radicals,  such  for  instance  as  OH 
radicals,  which  would  be  held  very  tightly  to  the  surface;  we  should  expect  a compli- 
cated .surface  chemistry  involving  the  various  materials  which  couki  be  bound  to  a ger- 
manium surface.  In  pr.actice,  there  seems  often  to  be  a rather  extensive  layer  of 
oxide  on  a germanium  surface,  presumably  a result  of  oxygen  diffusing  down  into  the 
crystal  No  doubt  the  resulting  surface  material  will  be  of  complicated  foim.  GeO, 
evicts  in  two  cry.stal  structures,  the  rutile-  str-uct-urc  like  TiO^,  and  the  4uartz  struc- 
ture like  SiO,.  Fi-esumabiv  the  surface  material  will  try  to  simulate  one  or  the  other 
of  these  structures,  but  on  account  of  being  tightly  Joined  to  the  Ge  crystal  below, 
these  oxide  crystals  will  be  distorted,  and  in  practice  ike  rnat.:rial  is  found  to  he  prac- 
tically amorphous,  indicating  that  the  crystals  are  too  small  to  show  structure  by 
electron  diffraction  or  oxhi=r  methods.  With  such  a complicated  surface,  it  is  obvious 
that  no  simplified  treatment  assuming  an  ideal  geometrical  surface  can  possibly  be 
valid. 


When  crystal  surfaces  can  be  produced  wkh  known  impurity  layers  on  them, 
it  may  be  possible  to  set  up  a theory  of  surface  states,  having  some  analogy  to  our 
treatment  of  the  present  chapter,  but  also  analogy  to  the  theory  of  molecular  orbitals 
in  molecules.  It  is  clear  that  our  problem  is  half  chemical,  half  physical.  T he  bonds 
between  the  atoms  will  be  similar  to  covalent  be,  ids  in  molecules,  and  as  such  can  be 
described  fairly  well  by  the  molecular  orbital  method;  but  once  we  are  inside  the  erj's- 
tal  of  germanium  or  other  material,  we  have  a typif  al  solid-state  problem.  For- 
tunately, the  tight -binding  approximation  to  the  energy  band  problem,  as  considered 
in  this  chapter  and  the  preceding  one,  is  entirely  analogous  to  the  LCAO  molecular 
orbital  method,  so  that  it  -will  be  possible  to  join  these  two  treatments  easily.  But  we 
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p.'iint  out  again  that  tVipre  is  very  little  use  in  spnoulating  about  su.ch  problems,  until 
there  is  a lairiy  good  indication  as  to  v.hat  atoms  we  really  have  in  the  surfaces  we  are 
considering. 

Another  type  of  crystal  problem  similar  to  that  of  surface  states  is  t>iat  of  the 
energy  levels  surrounding  a dislocation  An  edge  uioluetition  in  a crystal  may  be 
dfcsei  ibfu  as  Split  or  crack  in  a '••ry.stoj,  endint;  in  a.n  extra  plane  of  atoms  which  fills 
in  the  crack.  In  that  region  where  the  crack  is  of  appreciable  vvidth,  the  atoms  on  the 
two  surfaces  of  the  crack  are  not  in  very  different  situations  from  the  atoms  on  the 
surface  of  a crystal,  and  we  should  expect  that  something  like  surface  states  should 
arise  The  problem  is  obviously  much  more  complicated  than  that  of  a surf  3.CG , tinci 
no  attempts  have  been  made  so  far  to  consider  the  detailed  nature  of  the  energy  levels 
There  is  r.nother  aspect  which  a dislocation  p;  ,_sumably  has  in  common  with  a i^ui  race; 
bonds  are  broken  between  neighboring  atoms,  and  this  results  in  a particularly  favor- 
able position  for  the  adsorption  of  foreign  atoms.  It  seems  altogether  likely  that  im- 
purity atoms  in  crystals  v/ill  tend  for  this  reason  to  diftime  to  dislocations  and  segre- 
gate there,  and  this  possibility  has  occurred  to  many  workers  in  the  field.  In  this 
case,  tVte  wave  functions  concentrated  in  tl-e  neighborhood  will  have  partly  the  charac- 
teristics of  impurity  states,  and  can  be  discussed  by  the  same  general  methods,  ex- 
cept of  course  that  the  geometry  is  much  more  complicated  than  that  arising  from  a 
single  substitutional  atom. 


The  examples  which  we  have  mentioned  will  illustrate  the  breadth  i> 


prob  - 


.ims  which  can,  in  principle,  be  handled  by  the  type  of  discussion  taken  up  in  this 
chapter.  In  every  case,  we  shall  expect  the  same  general  situation,  though  the  de- 
tailed method  of  handling  the  problem  can  differ  in  different  cases,  and  can  be  very 
complicated.  The  atoms  in  the  neighborhood  of  the  disturbance  in  the  cry'stal  would, 
if  treated  by  themselves,  have  one-electron  energy  levels  quite  different  from  those  of 
the  crystal  as  a whole.  If  these  energy  levels  happen  to  lie  in  an  energy  gap  of  the 
crystal,  then  the  wave  function  of  the  composite  system  will  tend  to  be  large  in  the 
neighborhood  of  the  disturbance,  falling  off  exponentially  as  we  go  into  the  rest  of  the 
crystal,  since  the  only  type  of  solution  of  Schrodinger 's  equation  in  the  crystal  having 
this  energy  is  an  exponential  one.  Conversely,  thf"  wave  functions  corresponding  to 
the  energy  bands  in  the  crystal  will  fall  off  as  we  approach  the  lattice  disturbance. 


8.  Periodic  Perturbative  Potentials 

There  is  one  type  of  perturbation  which  is  quite  different  from  the  ones  which 
ws  have  taken  up  so  far,  but  important  enough  to  deserve  special  treatment.  This  is 
the  case  where  the  perturbative  potential  is  a sinusoidal  functi^.-n  of  position  in  tne  lat- 
tice. VVe  should  only  get  a p»oter.tial  which  wa.s  literally  of  this  form  if  we  had  a sinu- 
soidal electromagnetic  wave  in  the  crystal-  If  its  wave  ienglh  was  comparable  with  the 
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interatomic  distance,  we  should  be  dealing  with  x -rays,  and  would  have  to  'reat  the 
problem  by  quantum  electrodynamics.  However,  we  have  a physically  equivalent  prob- 
lem if  we  have  a sinusoidal  wave  of  sound  in  the  crystal.  It  produces  perturbations 
which  are  equivalent  to  a sinusoidal  perturbative  potential,  and  its  effect  on  the  e'ec- 
trons  of  the  iatticc  can  be  treated  by  the  methods  of  this  chapicr  Tins  problcru  u. 
iinooi'iani  uecause  ine  liiei'iiiai  aj^ilaliwn  ,u  a ciqy.ticil  can  be  treated  ciS  a supc rposition 
of  sound  waves,  mostly  of  very  short  wave  lengths.  We  meet  the  effects  of  the  scat- 
tering of  electron  waves  by  such  sound  waves  when  we  study  the  effect  of  temperature 
agitation  on  electrical  conduct  i vity. 

We  shall  find  ihat  the  effect  of  a sinusoidal  perturbative  energy  on  the  motion 
of  electrons  in  a periodic  lattice  is  very  much  like  the  effect  of  such  a perturbation  on 
free  electron<=;  the  resen-ihl  ance  eln«5.->r  than  v.ouUl  be  expected  merely  from  the 
possibility  of  replacing  tae  difference  equation  by  a differential  equation,  /tccordingly 
first  we  shall  remind  the  reader  about  the  effect  of  a sinusoidal  potential  on  the  motion 
of  free  electrons.  Then  we  shall  consider  a sinusoidal  perluibation  in  our  periodic 
potential  problem.  Finally,  'we  shall  take  up  the  case  of  a sound  wave  in  detail,  and 
show  why  it  is  reasonable  that  its  effect  can  be  replaced  by  an  equivalent  periodic  per- 
turbation. 


An  electron  moving  in  a sinusoidal  field  is  of  course  a well-known  problem, 
the  simplest  example  of  a pex  iodic  potential,  and  one  leading  to  the  Mathieu  equation. 

If  the  potential  has  the  form  cos  Kx,  we  can  separate  variables  in  a iiii  ee-dimen- 
sionai  pi  oblei  i,  and  the  motion  will  be  free  as  far  as  y and  z arc  concerned.  The 
Schrodinger  equation  for  the  motion  along  x is  the  Mathieu  equation.  lis  energy  levels 
fall  into  energy  bands,  the  wave  functions  obeying  Bioch’s  theorem.  That  is,  the 
wave  function  can  be  written  as  a sum  of  exponential  functions  ol  loi nn  exp{i(k  + nK).x), 
where  k is  a propagation  constant,  n an  integer  For  the  low'er  energy  levels,  falling 
ill  narrow  energy  bands,  the  coefficients  associated  with  the  exponential  functions  of 
the  type  written  above  will  fall  off  very  slowly  wilh  n,  in  a complicated  way,  which  has 

f t \ 

been  discussed  by  the  writer,  However,  if  is  sufficiently  small,  there  will  be 
(lu  really  narrow  energy  bands,  and  the  situation  is  qc'te  different.  This  is  the  case 
in  which  there  are  no  tigh'.iy  bound  states  in  the  potential  minima,  or  in  wnich  is  a 
very  small  perturbation.  In  such  cases,  the  coefficients  will  fall  off  very  rapidly  with 
n,  only  the  cases  with  n = 0,  t i being  appreciable.  It  is  only  such  cases  tuat  we  shall 
consider;  only  for  such  a small  amplitude  is  the  effect  of  a sinusoidal  perturbative 
potential  simple.  For  iargei’  perturbation.s  bound  slates  can  arise,  and  all  manner  of 
complications. 

For  small  V , the  solution  of  the  Ma.thieu  problem  has  a simple  form.  'I'he 
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coefficients  of  the  terms  with  ri  = t.  1 are  in  V . starting  with 

^ u"' 

terms;  those  with  n = t 2 are  power  series  starting  with  second  power  terms;  and  so 
on.  Stmilarl;y  Uie  energy  is  a pov.er  series  starting  with  a second  power  term.  If 
is  small  enough,  we  can  considet  only  'insar  terms  in  the  w'avs  function,  including 
thus  only  the  terms  with  n = t i,  and  uai:  disregard  the  perturbat;-''!  in  the  en-rrgy, 
sinre  it  is  in  But  this  is  just  the  approximation  made  by  iirst  order  perturbation 

theory.  In  other  words,  such  a first  order  perturbation  is  adequate  for  the  small 
amplitudes  which  we  are  considering.  We  regard  the  wave  function  as  the  sum  of  an 
unperturbed  wave,  a plane  wave,  of  form  exp(ikx),  and  two  perturbed  waves,  of  form 
exp(i(k  t K;x),  with  amplitudes  proportional  to  V^. 

By  perturbation  theory,  we  can  at  once  find  the  amplitudes  of  these  scattered 
waves.  The  matrix  compe-nent  of  the  perturbative  potential  cos  Kx  between  the  two 
wave  functions  exp(ikx)  and  exp{i(k  + K)x)  is  at  onoe  seen  to  be  simply  a constant  (de- 
termined by  normalization)  times  V^.  Thus  by  fii  st  order  perturbation  iheory  we 
know  that  the  coefficient  of  the  function  exp(i(k  - K)x)  is  equal  to  the  non-diagonal  ma- 
trix component  of  energy  between,  this  function  and  the  unperturbed  function  exp(ikx), 
which  is  proportional  to  divined  by  the  energy  ditference  between  these  waves. 

When  we  nut.  Ihe  problem  in  three-dimensional  vector  language,  this  energy  difference 
is  that  between  waves  witli  propagation  vectors  iT and  iTt  K.  In  the  general  case,  this  en- 
ergy difference  is  large.  However,  there  are  special  cases  where  the  difference  is  s.Tiaii, 
or  vanishes.  In  such  a case  the  amplitude  of  the  scattered  wave  becomes  large,  and  in  the 
limit  where  the  difference  vanisnes,  a power  series  expansion  of  the  type  we  have  been  con- 
sidering breaks  down,  and  we  are  not  allowed  to  use  first-order  perturbation  calculations. 
This  case  is  just  thcLt  in  which  the  two  perturbed  waves  are  related  to  the  unpcA'turbed  wave 
by  Bragg'.s  law,  as  if  we  had  x-ray  scattering  by  planes  of  scattering  material  lying  on  the 
wave  fronts  of  the  lunction  V^cos  Kx.  Ir.  suck  ? we  must  use  a solution  consisting  of 

equal  contributions  of  the  direct  and  scattered  waves.  If  we  solve  a secular  equation  with  two 
rows  and  columns  between  these  two  waves,  in  this  neighborhood  of  the  Bragg  scattering,  we 
arrive  at  the  energy  gap  in  the  well-known  way.  In  other  words,  Bragg  scattering  by  the  si- 
nusoidal scattering  potential  introduces  energy  gaps,  just  as  Bragg  scattering  by  the  periodic 
potential  inthe  crystal  does.  But  these  energy  gaps  will  be  found  across  planes  in  k space 
determined  by  the  wave  vector  K of  the  perturbing  wave. 

In  this  sketch,  we  have  seen  the  general  outlines  of  the  theory  of  scattering  of 
free  electrons  by  a small  sinusoidal  potenlial.  Now  we  see  that  the  same  sort  of  thing 
car,  be  carried  through  for  a sinusoidal  perturbing  potential  5uperpos.''d  on  a periodic 
potential.  The  reason  why  the  problem  is  no  more  difficult  is  simply  that  we  are  con- 
tenting ourselves  with  a first  order  perturbation  calculation,  and  this  can  be  carried 
out  just  as  well  in  the  periodic  potential  case  as  with  free  electrons.  We  start  with 
unperturbed  functions  of  the  form,  of  ^(R;)  a(r  - R^),  v'here  the  U(R^)'s  are  of 
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the  exponential  form,  rxp(ik  • H.).  Let  us  assume  that  ,ve  have  a periodic  perturbation, 
such  that  there  a^e  no  non-diagonal  matrix  components  between  Wannier  functions  ::: 


different  bands, 
form.  V cos  K 


r on  difftrent  s.tc 


and  tiic  diagonal  matrix  components  are  oi  the 


R,  Tiien  we  find  at  once  that  there  are  non-vanishing  matrix  enrn- 

een  two  Bloch  fitnoTions  corresnondinij  to 


poiieiits  of  tiic  [jei  tui  bail  VC  potential  Only  b 


exp(ik  • R.)  and  exp(i(k  t K)  ' R.).  As  with  the  free  electron  case,  these  matrix  com- 
ponents  are  simply  normalization  constants  limes  V^.  We  can  then  set  up  a perturbed 
wave  function,  in  which  these  scailered  waves  again  have  amplitudes  proportional  to 
V divided  by  the  difference  between  the  energies  of  the  waves  with  propagation  vec- 
tors  k and  with  k t K.  The  only  difference  between  this  case  and  the  free  electron 
case  is  that  now  we  are  to  compute  the  energies  of  these  waves  using  the  proper  ex- 
pression for  Ej^,  as  determined  by  the  energy  band  theory. 

We  note  that  this  difference  in  the  situation  means  that  the  scattering  in  the  ac- 
tual crystal  will  not  in  general  follow  the  Bragg  law.  The  derivation  of  this  law  is 
based  on  the  assumption  Lhal  the  energy  is  a function  only  of  the  magnitude  of  the 
propagation  vector,  so  that  the  quantiiies  jki  and  j k - Kl  are  equal.  On  the  other 
hand,  with  a different  dependence  of  the  energy  on  k,  this  will  not  be  the  case.  The 
result  of  the  Bragg  law  is  that  all  waves  of  propagation  vector  k terminating  in  planes 
perpendicular  to  K,  and  distant  by  the  amount  1k/2|  from  the  origin,  will  be  scattered 
strongly  by  the  sinusoidal  potential.  With  a more  general  dependence  of  energy  on  k, 
the  surfaces  on  which  vactors  must  terminate  to  be  scattered  by  this  potential  will  no 
longer  even  be  planes;  they  can  have  complicated  forms.  This  fact  does  not  seem  to 
be  recognized  in  the  literature,  for  the  assumption  that  the  energy  is  a function  of  the 
magnitude  of  k is  generally  made  at  the  very  beginning  of  any  discussion  of  scattering. 

We  can  extend  these  results  without  trouble  to  a considerably  mere  general  case. 
Thus,  let  us  assume  that  we  have  matrix  components  of  the  pertuibative  energy  be- 
tween Wannier  functions  on  different  atoms,  and  connected  with  different  bands;  but 
let  us  still  assume  that  there  is  a sinusoidal  dependence  on  position.  That  is,  let  us 
assume  that  the  matrix  component  of  perturbative  energy  between  Wannier  functions 
a^(r  - R.)  and  a^^r  - it.)  depends  in  an  arbitrary  way  on  n and  m,  and  on  the  vector 

displacement  R.  - R.  between  the  two  atoms,  but  that  it  depends  on  the  absolute  pcsi- 
1 ^ ' K R ■ 

tion  of  the  perturbed  atoms  in  space  only  through  an  exponential  factor  e ’ i,  or  a 

corresponding  cosine  function.  We  shall  still  ^ee,  simply  on  account  of  the  exponential 
dependence,  that  there  will  be  niatrix  components  of  this  perturbative  energy  or.iy  bi-;- 
tween  plane  wave  solutions  having  propagation  constants  k and  k t K.  The  matrix 
component  will  depend  linearly  on  the  various  non-diagonal  as  well  as  diagonal  matrix 
components  of  the  perturbative  energy,  in  a way  which  can  be  easily  expressed  mathe- 
matically. There  are  then  only  two  differences  between  this  case  and  the  earlier  one: 
(l)  the  formula  for  non -diagonal  matrix  component  of  energy  between  the  incident  and 
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3C';ittered  wave  will  be  more  comolicated  than  before;  and  (2)  Ihex-e  will  be  matrix 

components  of  energy  between  an  incident  wave  set  up  from  the  'A'annier  fr notions 

of  one  band,  and  a scaliered  wave  set  up  frc/m  Wannier  functions  a of  another  band. 

^ m. 

It  could  well  happen  that  a wave  of  propagation  vector  k in  one  b.-ind  could  have  the 
sarnc  ."r.crgy  as  one  of  prop.agation  vector  k t K in  anot'ner  band.  Thus  there  could  be 
circumstances  where  the  periodic  potential  perturbation  could  result  in  strong  scat- 
tering of  electrons  from  one  band  to  anothei , provided  the  bands  overlapped  in  energy. 

We  have  now  examined  the  effect  of  a periodic  perturbation  on  the  wave  func- 
tion; next  let  us  ask  why  a .sound  wave  will  produce  such  a perturbation.  We  assume 
that  this  sound  wave  is,  so  to  ?p_uk,  frozen  into  the  crystal;  that  is.  we  have  displace- 
ments of  the  atoms,  varying  sinusoidally  through  the  crystal,  but  independent  of  time. 
We  shall  later  consider  the  time  dependence  which  would  be  present  in  an  actual  sound 
wave.  Wc  mu.st  now  consider  the  perturbing  potential  arising  from  the  sinusoidal 
wave.  This  perturbing  potential  will  be  tlie  difference  between  the  potential  of  a self- 
consistent  field  f.)r  the  undistorted  crystal,  and  for  the  crystal  with  the  sound  wave  in 
it.  It  will  have  quite  a different  form  in  those  parts  of  tne  crystal  where  the  sound 
wave  is  producing  a compression  or  rai’efaction,  and  in  the  other  parts  of  the  ci’ystai 
-../here  the  atoms  are  aispiaced  by  a maximum  amount,  but  without  change  of  density. 
Let  us  fii'st  consider  ti’e  regions  of  maximum  strain,  where  there  is  change  of  density. 

In  a region  of  compression,  for  instance,  each  atom  will  find  its  neighbors 
closer  to  itself  than  in  the  undistorted  crystal.  The  po  Ltri'itictl  of  s.n  clscti'on  al- 

ways tends  to  be  depress'.cd  by  the  prese.’fe  of  neighbors;  consequently,  in  such  a 
region,  there  will  be  a net  decrease  of  potential  energy,  leil  ai  ail  atoms  in  the  neigh- 
borhood. This,  will  result  in  a modification  of  the  diagonal  and  non  — diag^inal  matrix 
components  of  energy  between  the  various  atomic  orbitals,  if  we  are  using  a light- 
binding  aopi  oximaticn.  It  is  such  a modification  as  would  be  produced  by  a compres- 
sion of  the  whole  crystal.  The  modification  of  the  diagonal  matrix  components  would 
result,  if  extended  through  the  crystal,  in  a displacement  of  the  band  as  a whole,  and 
the  modification  of  the  non-diagonal  matrix  components  of  energy  would  result  in 
changing  the  widths  of  the  bands.  Such  changes  of  energy  bands  are  known  to  arise 
from,  the  compression  of  a iatlice,  and  the  effect  of  the  deformation  on  the  position  of 
thp  pnergy  harid.s  has  been  described  by  Bardeen  and  Shock.ley^'^  as  a deformation 
poteniial.  We  see  that  if  we  look  a itlle  more  carefully,  we  sh.-ill  not  have  just  ■ync 
such  potential,  but  rather  shall  have  terms  in  all  the  diagonal  and  non-diagonal  matrix 
components  of  perturbative  energy  between  the  various  atomic  orbitals  or  W'annler 


J.  Bardeen  and  W.  Shockley,  Phys.  Rev.  72  (1950).  For  a discussion  of  this 
and  related  topics  in  the  effect  of  a sinusoidal  perturbative  poteniial,  see 
W.  Shockley,  Holes  and  FJlecirons  in  Electrons  in  Semiconductors  (John  Wiley 
and  Sons,  Inc. ) 1951. 
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L't  ■ , ur  isii'.^  frorn  df^fftrrnn.f 

These  inetrix  components  of  pertnrhative  potential  will,  of  course,  be  pro- 
portional to  the  magnitude  of  the  compressiori,  provided  it  is  small,  and  hence  will 
%ary  sinusoidally  through  the  crystal,  aod  will  have  the  form  we  have  already  con- 
sidered, tak.ng  on  their  maximum  values  at  the  positions  of  maximum  compression 
and  rarefaction.  In  addition,  there  will  be  other  terms  in  the  matrix  components  of 
perturbative  potential,  taking  on  their  maximum  values  at  Uie  inter  mediate  positions 
where  the  displacement  of  the  atoms  is  a maximum,  but  where  there  is  no  compres- 
sion. To  understand  these  terms,  let  us  ask  bow  we  should  handle  the  problem  which 
would  arise  if  the  whole  crystal  were  to  be  displaced  bodily  by  an  infinitesimal  amount. 
If  we  tried  to  expand  the  perturbed  wave  functions  m terms  of  the  unperturbed  ones 
(located  at  the  original  atomic  sites),  we  should  of  course  merely  end  up  with  wave 
functions  which  were  disniaced  by  tiie  same  amount  as  the  lattice,  and  ’.vit'i  energy 
values  which  were  unchanged.  We  could  not  accomplish  this  perfectly  if  we  were 
merely  using  a finite  set  of  atomic  orbitals,  rather  than  a complete  set  of  Wannier 
functions.  These  matrix  components  of  energy  which  have  their  maximum  values  at 
the  positions  of  greatest  displacement  in  the  lattice  wilt  be  trying  to  achieve  just  this 
result,  of  displacing  the  wave  functions  bodily  without  having  an  effect  on  the  energy; 
they  will  achieve  it  as  well  as  possible,  consistent  with  the  limited  number  of  atomic 
orbitals  or  Wanmer  functions  ordinarily  used. 

All  these  terms,  as  we  see,  will  vary  sinusoidally  with  position  through  the 
crystal,  and  hence  will  fall  into  the  general  scheme  which  W3  have  described.  Hence 
the  effect  of  such  a sound  wave  will  be  to  pi  oduce  scattered  waves,  of  propagation 
vector  k i K;  and  as  we  have  seen,  the  amplitude  of  this  scattered  wave  will  be  large 
only  if  it  has  almost  exactly  the  same  energy  as  the  in<'ident  wave.  As  we  have  seen 
earlier,  vjc  can  well  have  matrix  components  of  the  perturbative  energy  between  wave 
functions  connected  with  different  bands,  so  that  electrons  can  be  scattered  from  one 
band  to  another,  provided  this  is  compatible  with  scattei’ing  without  change  of  energy. 

We  have  been  talking  about  scattering  by  a frozen-in  sound  wave;  but  this  is 
not  what  we  encounter  in  scattei'ing  by  thermal  oscillations,  which  can  be  analyzed 
into  a superposition  of  real  sound  waves,  traveling  through  the  crystal,  and  hence  with 
a potential  depending  on  lime.  This  problem  cannot  be  nandled  so  simply,  and  we 
shall  merely  indicate  the  .nature  of  its  treatment  (see  Ref.  1,  P 62  lor  a good  dis- 
cussion of  this  problem).  It  has  a close  resemhla.nce  to  problems  in  quantum  electro- 
dynarrocs;  the  interaction  of  the  electrons  with  sound  waves  is  not  entirely  different 
fi  um  the  interaction  with  electromagnetic  waves.  In  the  first  place,  we  can  get  a good 
approximation  by  treating  the  perturbation  simply  as  an  external  perturbative  poten- 
tial varying  with  the  time,  using  the  method  of  variation  of  constant.s.  We  then  find 
that,  if  we  start  at  t = 0 with  .a  single  Bloch  soiulion  of  the  electronic  problem,  and  a 
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perturbali Vf  potential  varying  sinusoidally  wi'h  time  with  a frequency  v.  a probability 
appears  oi  transition  to  another  Bloch  wave,  the  amplitude  being  proportional  to  the 
amplitude  of  the  perturbation.  This  scattered  wave  is  the  form  in  which  the  perturbed 
wave  appears,  v,<hen  we  are  using  the  method  of  variation  of  con.3tant.s,  in  which  we 
are  not  netting  up  statio.iary  states. 

If  k is  the  propagation  constant  of  the  initial  Bloch  wave,  ant'  if  the  sound 
wave  has  a propagation  constant  K,  then  we  find,  as  we  should  expect,  that  we  have 
transitions  only  to  Bloch  waves  with  propagation  constants  k j.  K.  The  amplitude  of 
the  scattered  wave  does  not  have  the  difference  between  the  energies  of  the  Bloch 
waves  with  propagation  constants  k and  k t K in  the  denominator,  however,  as  in  the 
static  case-  Instead,  it  has  this  difference  of  energy,  t hv.  In  other  words,  for  a 
large  scattering  amplitude,  the  energy  of  the  scattered  Bloch  wave  must  be  greater 
or  less  than  that  of  the  incident  wave,  by  the  energy  of  a phonon  of  the  appropriate 
sound  wave.  Ordinarily  the  electronic  energy  is  so  large  in  comparison  with  phonon 
energies  that  this  modification  of  the  results  is  unimportant.  However,  Shockley  (see 
Ref.  1,  Page  62)  points  out  certain  cases  in  which  the  phonon  energy  is  important;  it 
obviously  can  be  when  the  electronic  energy  comes  so  close  to  the  bottom  of  an  en- 
ergy band  that  we  are  dealing  with  very  small  electronic  energies. 

As  in  the  case  of  quantum  electrodynamics,  this  method  of  handling  the  tiine- 
depenuent  field  by  the  method  of  variation  of  constants  applied  lo  a puLeulial  perturba- 
tion which  varies  with  time  is  not  completely  adequate.  We  remember  that  in  quantum 
electrodynamics,  =ucu  a melhed  can  give  us  u treatment  of  absorption  and  forced  emis- 
sion, but  not  of  spent. ancous  emissic^.  Tc  treat  that,  we  must  use  the  Uirac  radiation 
theory,  in  which  the  electromagnetic  field  is  treated  as  part  of  the  system,  as  v'ell  as 
the  atoms.  We  then  set  up  an  unperturbed  problem  in  which  the  atoms,  and  the  elec- 
tromagnetic field,  do  not  interact  with  each  other.  We  set  up  their  stationary  state.s, 
which  are  described  by  giving  the  various  quantum  numbers  of  the  atomic  sy'stem,  and 
the  numbers  of  photons  in  the  various  e.lectromagnetic  vibrations.  We  then  introduce 
the  previously  neglected  interaction  terms  as  perturbations,  treat  them  by  variation 
of  constants,  and  find  that  there  are  probabilities  of  transition  in  which  the  atomic 
system  gains  or  loses  energy,  with  compensating  loss  or  gain  of  energy  by  the  elec- 
tromagnetic field,  so  that  the  states  which  perturb  each  other,  and  which  lead  to  the 
transitions,  are  states  in  which  the  whole  energy,  that  of  atoms  and  field,  is  unchanged 
in  the  transition. 

The  situation  is  somewhat  similar  with  our  acoustical  case,  though  not  iden- 
tical. We  treat  the  electronic  .system,  and  the  nuclear  vibrations,  as  independent  to 
a first  approximation,  describing  the  nuclear  vibrations  by  normal  coordinates.  We 
must  here  work  in  the  framewoi  c ol'  the  Born-Oppen.heimer  appi  oximalion,  in  which 
the  electronic  problem  is  solved  for  fixed  positions  of  the  nuclei,  with  an  energy 
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uepeiidlng  parametrically  on  the  nuclear  positions.  This  energy  must  then  be  used  as 
a potential  energy  for  the  nuclear  framework,  describing  its  vibrations.  We  start 
with  an  unperturbed  wave  function  w'hich  is  a product  of  Hermite  functions  of  the  nor- 
mal coordinates  describing  the  nuclear  displacements,  and  an  electronic  function  con- 
taining the  normal  coordinates  p-  nametrically.  Then  we  insert  as  perturbations  the 
term  in  the  HamlAo^'an  which  is  leglected  in  the  Born-Oppenheimer  approximation, 
terms  in  th"'  rmr.c:=  derivai Lv'cs  of  the  wave  function  of  the  electnor.ic  niolion  witti  re- 
spect to  both  electronic  and  nuclear  coordinates.  When  this  perturbative  term  is  in- 
serted, we  find  as  in  the  electromagneitic  case  that  it  introduces  transitions  between 
different  approximate  stationary  stales  of  the  nuclear  and  electronic  parts  of  the  sys- 
tem. As  before,  these  are  tr.ansitions  in  which  the  tot?'  energy,  electronic  plus  nu- 
clear, does  not  change;  so  that  as  with  the  method  of  the  time -dependent  perturbative 
potential,  we  conclude  that  those  transitions  will  occur  in  which  the  eieclron  gains  or 
loses  the  energy  of  a phonon  in  the  scattei  ing  process. 

These  problems,  whose  treatment  we  have  merely  sketched,  are  really  very 
intricate,  and  we  shall  not  treat  them  further  here.  There  is  room  for  a really  com- 
plete treatment  of  sucli  questions,  based  on  the  Wannier  function  method  of  handling 
the  scattering  problem;  such  a treatment  has  not  been  given.  We  meet  these  questions 
in  a practical  way,  of  course,  in  the  problem  of  electrical  conductivity,  where  it  is 
well  known  that  the  scattering  of  the  electron  waves  by  the  lattice  vibrations  is  one  of 
the  main  sources  of  electrical  resistance.  We  realize  how  intricate  the  problem  can 
be,  when  we  recall  that  the  t.ieories  of  superconductivity  of  Bardeen  and  Frohlich  are 
based  on  the  Lreatment  of  the  interaction  between  electronic  motion  and  lattice  vibra- 
tions, and  that  none  of  the  existing  methods  of  handling  that  problem  are  really  satis- 
factory, in  spite  of  the  large  amoiint  of  effort  which  has  been  put  into  the  pi  uuiem. 

9.  A Constant  Accelerating  Field 

As  a final  example  of  perturbative  potentials,  we  take  the  case  of  a constant 
accelerating  field,  which  of  course  is  required  in  any  discussion  of  electrical  con- 
ductivity. In  such  a problem  our  primary  interi?st  is  in  the  time -dependent  problem, 
in  which  the  field  continually  accelerates  an  electron,  rather  than  in  the  stationary 
states.  We  can  handle  this  best  by  the  method  of  wave  packets.  Let  us  consider  an 
electron  in  a given  energy  band,  with  a given  k vec+or,  and  ask  how  it  is  accelerated 
by  the  constant  external  field.  We  first  build  up  a wave  packet,  by  superposing  Bloch 
waves  whose  k's  are  all  near  the  one  we  are  interested  in,  and  all  coming  fiOin  the 
same  enei-gy  band.  This  wave  packet  is  supposed  to  b-j  localized  near  a given  point  of 
space.  We  know,  from  the  uncertainty  principle,  that  complete  localization  is  incom- 
patible  with  using  just  waves  of  nearby  k’s.  In  fact,  the  closest  we  can  come  to  a 
really  localized  function  is  a Wannier  function,  and  this  uses  all  the  k's  in  the  band. 
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If  we  use  k‘s  chosen  from  the  immediate  neighborhood  of  one  point  in  k space,  the 
wave  packet  will  be  much  more  spread  out  than  a Wannier  function.  But  this  is  satis- 
factory foi-  the  present  purpose,  for  the  acceieraiing  fields  met  in  the  ordinary  dis- 
cussion of  conductivity  are  small  enough  so  th.at  the  potential  changes  only  slightly  in 
u distance  cf  many  j attire  spacings. 

When  we  have  set  up  this  wave  packet,  we  ask  how  it  moves  under  the  action 
of  the  accelerating  field.  To  answer  this  question,  v.c  use  the  '.veil -kno'vt-i  them  cm 
stating  that  the  center  of  gravity  of  a wave  packet  moves  according  to  the  classical 
Hamiltonian  equations,  provided  the  wave  function  satisfies  a Schrodinger  equation 
constructed  ii'ori,  the  same  Hami Homan.  We  use  this  theorem  in  connection  with  our 
solution  of  tlie  perturber)  pei  iudic  potential  problem  in  terms  of  Wannier  function,  and 
in  particular  in  connection  v.dth  the  Schrodinger -like  equation  given  in  Eq.  (2.  6).  We 
express  our  wave  packet  by  giving  the  expansion  coefficients  U(r  ) in  terms  of  Wan- 
nier functions,  rathei  than  in  letnis  of  Biocli  functions,  .and  then  use  the  iime -dependent 
form  of  Eq.  (2.  6)  to  determine  the  time  variation  of  the  U's.  Before  doing  this,  we 
must  examine  one  thing:  the  justification  of  using  the  particular  form  of  perturbative 
potential  used  in  Eq.  (2.  b),  in  which  we  have  matrix  components 

perturbative  potential  between  Wannier  functions  only  if  n = m,  and  for  ii  is 

only  in  this  case  that  an  ordinary  Schrodinger  equation  is  appropriate.  The  justifica- 
tion in  iivi-.  case  i.s  that  the  pi^rturbative  potential,  which  is  a linear  function  of  posi- 
tion if  we  are  dealing  with  a cuiistani  external  field,  is  assumed  to  vary  so  slowly  with 
position  that  over  the  region  where  one  Vv'annic-r  i'unction  a^(r  - R^  is  appreciably 
different  from  aero,  the  potential  may  be  treated  as  constant.  T’ne  matrix  component 
V (R  , K.),  which  is  equal  to  ■•■'r  - R ) Va  (r  - R ) dv,  where  V is  the  perturba- 

live  poientiai,  Uien  beconics  equal  to  an  average  value  of  V,  tunes  an  overlap  integral 

between  two  Wannier  functions,  vyhich  is  zero  unless  n = m and  R.  = R..  We  shall  then 

^ 1 J 

have  departures  from  our  case  of  Elq.  (2.  6)  only  for  very  strong  fields;  we  shall  con- 
sider that  case  latei’ 

From  Eq.  (2.6),  we  see  that  U(r)  satisfies  a Schrodinger  equation  with  Ilamii- 
tonian  E(p  ) + V(r),  where  we  have  written  E(k)  in  terms  of  the  equivalent  p = k)(,  where 
this  p is  the  quantity  to  be  converted  into  a differential  operator.  Thus  the  center  of 
mass  of  a wave  packet  will  move  according  to  the  Hamiltonian  equations  der  ived  from 
this  Hamiltonian.  These  Hamiltonian  equations  are 


dp^/dt  = -9V/9x,  dx/dt  = 9E/9p^ 


(2,  68) 


with  similar  equations  for  y and  z.  These  equations  have  a very  simple  meaning.  The 
first  one  means  that  the  center  of  mass  of  the  wave  packet  moves  in  k space  according 
to  simple  classical  mechanics,  the  rime  rt.ie  of  change  o.'  the  pseudemomentum  p 
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(which  we  remember  15  not  really  a momentum)  equaling  the  external  force  9'he 
sc-concl  can  be  easily  shown  to  be  equivalent  to  the  formula  for  group  velocity  ol  the 
'.va'vc  packet.  These  simple  results,  which  we  have  seen  follow  rlg',j-ou.sly  piovided 
Eq.  (Z.6)  IS  valid,  are  the  basis  of  the  ordinary  quantum  mechanical  treatment  of 
elertrir.al  conductivity. 

It  reiTiLiins  only,  then,  to  consider  the  case  of  fields  so  large  that  this  resiilt 

IS  no  longer  coiTect.  For-  a sufficiently  large  field,  the  components  - K^) 

V a (r  - H.)  dv  can  become  ap’preciable,  cunaecting  VVarsnier  fn.nctions  in  two  differ- 
m j ’ ' ° 

ent  bands.  Under  these  mr(:unisl...M.<rS,  the  Schrodir.ger -type  equation  (^i.  6>)  is  no 
longer  appi'opr iate,  and  vve  must  use  iiie  iJifference  equations  (Z.  5),  or  '.rent  the  inter- 
action with  other  bands  by  some  different  method.  The  result  will  be,  if  we  use  the 
time -dependent  treatment,  that  there  v/ill  be  a certain  probability  that  an  electron 
jumps  from  one  band  to  another,  under  the  action  of  the  field.  The  wave  packet,  in 
other  words,  will  not  be  preserved  unchanged,  but  will  gradually  aissipate,  with  re- 
duction of  intensity  in  the  original  band,  and  a building  up  of  intensity  in  the  other 

(1'  - - 

band.  Such  an  effect  was  discussed  many  years  ago  by  Zener,  ' on  the  basis  ol  ele- 
mentary band  theory,  and  is  usually  called  the  Zener  effect.  There  is  good  indication 
that  it  can  be  experimentally  observed,  in  conduction  in  germanium.  ' ' 

We  can  consider  the  Zener  effect  in  an  alternative  way,  which  is  very  in- 
sti’uctive.  Instead  of  asking  about  the  motion  of  wave  packets,  let  us  consider  the 
nature  of  stationary  states  in  the  p esence  of  ihe  constant  accelerating  field,  or  the 
potential  energy  varying  linearly  with  a coordinate,  say  x.  Let  us  .hen  set  up  a figure 


Fig.  7 

Energy  bands  in  presence  of  constant  external  field 

*C.  Zener,  Pr-oc.  Roy.  Soc.  (i.ondon)  Al45,  529  (i934).  Fi.)r  the  application  of 

Wannier  functions  to  this  problem.,  see  Paula  Feuer,  P ys.  Rev.  92  (1952). 

‘‘K.  B.  McAfee,  E.  J.  Ryder,  W.  Shockley  and  M.  Spai'k.^,  Phys.  Rev.  83,  650  (L) 
(1951). 
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like  Fig.  ?,  in  which  the  energy  is  plotted  vertically,  distance  through  the  crystal 
horizontally.  The  energy  bands  arc  shown  displaced  upward  by  an  amount  c-qurd  to 
the  perturbative  potential  V,  which  varies  linearly  with  -x.  ]\inw  we  consitler  a hori- 
zontal line  of  height  E,  as  shown.  This  line  falls  inside  each  energy  band  only  over 
a restricted  region  of  space.  If  we  set  up  a wave  packet  inside  an  energy  baud,  as 
w€  have  been  doing,  we  see  that  we  expect  this  packet  to  oscillate  back  ana  forth 
ihvc.ngh  the  band,  from  one  boundary  to  this  r.c.stricted  region  to  the  other;  it  is  only 
within  this  region  that  the  classical  velocity  of  the  wave  packet,  as  given  by  Eq.  (2.  68). 
will  be  real.  This  packet  would  be  made  up  of  wave  functions  whose  energies  were 
in  the  neighborhood  of  E,  and  they  would  be  oscillatoiy  (that  is,  lJ(r)  would  be  an 
oscillatory  function  of  position)  in  the  restricted  region,  but  would  fall  off  in  a more 
or  less  exponential  way  in  the  energy  gap. 

Now  we  could  set  up  solutions  of  this  type,  either  in  the  form  of  wave  packets 
or  statioec’’y  sf---fe‘=,  each  of  the  two  energy  l.-aiids.  in  ditfcT'en;  regions  of  space. 

It  is  clear  that  if  the  exponential  tails  of  these  two  solutions  did  not  fail  off  rapidly 
enough,  in  the  energy  gap,  the  tails  of  the  two  solutions  would  overlap  slightly,  and 
this  would  result  in  transuions  from  one  band  to  another,  by  a tunnel  effect.  There 
would  be  several  different  ways  to  treat  this.  We  could  set  up  stationary  solutions 
U(r  ) which  held  properly  in  both  bands;  these  would  he  linear  combinations  of  solu- 
tions in  both  regions,  beh.aving  in  a symmetric  or  antisymmetric  way  in  the  region 
between.  Then  we  could  set  up  wave  packets  from  these  stationaiy  solutions,  and 
we  should  find  a r'robability  that  the  packet  would  leak  from  one  band  to  another.  Or 
we  could  handle  it  directly  by  wave  packet  methods.  In  any  case,  we  should  find  that 
electrons  could  leak  across  the  gap  from  one  band  to  another,  jumping  in  the  process 
fiom  one  part  of  the  ci'ystal  to  anot’ner  part  in  which  the  changed  external  potential 
had  brought  the  conduction  band  down  to  the  same  energy  as  the  valence  band  in  the 
other  region.  Of  course,  such  a jump,  by  a tunnel  effect,  is  vei'y  unlikely  if  it  cor- 
responds to  a distance  of  more  than  one  or  two  atomic  diameters,  thus  we  see  ii;- 
tuitiv'ely  that  the  Zener  effect  is  not  going  to  be  important,  unless  the  applied  external 
field  is  so  strong  that  it  corresponds  to  an  energy  difference  of  the  order  of  an  elec- 
tron volt  (comparable  with  the  gap  widths)  in  an  atomic  spacing.  Thi.s  general  approach 
can  be  timde  *he  basis  of  a quantitative  treatment  of  the  Zener  effect  in  terms  of  Wan- 
nier  functions,  but  we  shall  not  go  into  it  here. 
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in  this  bibli-jgr^iphy,  we  have  included  papers  in  the  topics  obviously  related 
to  this  chapter,  such  as  the  theory  of  impurity  levels  and  surface  states  The  bibliog- 
raphy is  considerably  broader'  than  this,  however.  In  spite  of  the  small  amount  of 
discussion  which  v/e  have  given  lo  problems  of  electrical  conductivity,  in  the  discus- 
sion of  scattering  of  electi  on  waves  by  thermal  vibrations,  and  the  acceleration  of 
wave  packets  by  an  impressed  external  field,  it  has  nevertheless  seemed  desiiable  to 
include  the  nanerr.  on  r'onductivity  in  this  bibliography.  It  is  hoped  that  when  this 
series  of  reports  is  finished,  the  combined  bibliographies  will  cover  fairly  completely 
the  s'cbject  of  electronic  structure  ol  sol  ids,  .-.nd  since  v/c  shall  not  touch  on  problems 
cf  conductivity  again,  we  include  these  papers  here.  For  completeness,  we  have  also 
included  the  papers  on  electrical  breakdown  in  insulators,  those  on  superoonuuctivLty, 
and  the  closely  related  ones  on  free  electron  diamagnetism.  Sf  me  other  things  have 
definitely  not  been  included;  ferro-  and  antiferro-  and  paramagnetism,  cohesive  en- 
ergy of  r.olids,  the  theory  of  excitons  and  optical  absorption  by  solids  (except  such  few 
papers  as  bear  very  closely  on  the  topics  of  this  chapter),  thermal  vibrations  of  solids, 
and  various  other  topics.  Most  of  these  things  will  be  included  in  the  bibliography 
which  will  follow  the  final  technical  report  in  the  present  series,  which  will  deal  with 
various  problems  involving  electron  interactions  in  solids.  Also,  v/e  have  not  in- 
cluded the  rapidly  gro-wing  literature  on  the  transistor,  and  on  the  closely  related 
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